Dao-AILab / flash-attention Public

Notifications You must be signed in to change notification settings
Fork 1.9k
Star 18.8k

Code
Issues 798
Pull requests 76
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: Dao-AILab/flash-attention

Labels 9 Milestones 0

New pull request New

76 Open 243 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

feat: add support for pytorch2.8

#1801 opened Aug 8, 2025 by NanoCode012

Loading…

[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen

#1795 opened Aug 5, 2025 by stepinto

Loading…

fix race condition bug in cute _flash_attn_fwd in multiple gpu env

#1793 opened Aug 1, 2025 by beiw-nv

Loading…

[skip_ci] ABI stable fa3

#1791 opened Jul 31, 2025 by mikaylagawarecki • Draft

2 tasks

feat: blocksparse support

#1784 opened Jul 30, 2025 by guangyunh-nv • Draft

[CI] build upon manylinux, improve compatibility

#1780 opened Jul 29, 2025 by zipzou

Loading…

Fixes incorrect variable reference in comment

#1775 opened Jul 25, 2025 by LoserCheems

Loading…

Change the update method of the sub-module

#1774 opened Jul 25, 2025 by RealTapeL

Loading…

add var_len case for benchmark_mla_decode

#1770 opened Jul 22, 2025 by XiaobingSuper

Loading…

Add torch.compile support to flash attention 3

#1769 opened Jul 22, 2025 by guilhermeleobas

Loading…

Enable the deterministic mode option in the backward kernel

#1766 opened Jul 21, 2025 by GD06

Loading…

[AMD] Torch Compile Issues

#1756 opened Jul 15, 2025 by micmelesse

Loading…

Suppress warnings in windows compilation

#1748 opened Jul 10, 2025 by XXXXRT666

Loading…

Fix illegal memory access through off-by-one error in num_splits_dynamic_ptr init

#1747 opened Jul 10, 2025 by klondenberg-bioptimus

Loading…

Theoretically make compiling from pip quicker

#1703 opened Jun 8, 2025 by whrit

Loading…

fix: fa3 backward check qkv with qkv_scale and dqkv

#1686 opened May 29, 2025 by yuyu5333

Loading…

[skip ci] libtorch agnostic FA3 north star proposal

#1685 opened May 28, 2025 by janeyx99 • Draft

Fix/deterministic dk dv

#1678 opened May 26, 2025 by yuWeiCute

Loading…

Fix a bug in flash_attn_triton.py

#1668 opened May 15, 2025 by AminDarabi

Loading…

Useuful command to install flash faster on behamoth clusters

#1660 opened May 10, 2025 by sleepingcat4

Loading…

Fix typos in multiple files

#1655 opened May 8, 2025 by co63oc

Loading…

Patch RPATH of compiled Linux library to locate PyTorch and CUDA libraries in virtual env

#1634 opened Apr 30, 2025 by sisp

Loading…

feat: support to tile K and V separately in FA3 backward

#1626 opened Apr 28, 2025 by beginlner

Loading…

add checks for zero elements input of triton LayerNorm impl

#1621 opened Apr 27, 2025 by Luciennnnnnn

Loading…

Add PT compileable support for flash_attn_with_kvcache

#1592 opened Apr 14, 2025 by jataylo

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!