-
Notifications
You must be signed in to change notification settings - Fork 386
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
run core tests only if core files are modified
CLA Signed
This label is managed by the Meta Open Source bot.
#1275
opened Jun 8, 2025 by
tianyu-l
Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1274
opened Jun 8, 2025 by
lessw2020
Loading…
[SimpleFSDP] Add support for SimpleFSDP DCP
CLA Signed
This label is managed by the Meta Open Source bot.
#1273
opened Jun 8, 2025 by
ruisizhang123
Loading…
[deepseek][blackwell] add manual looping group gemm to enable base working inference on Blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1272
opened Jun 7, 2025 by
lessw2020
Loading…
add logging to keep track of internal usage
CLA Signed
This label is managed by the Meta Open Source bot.
#1271
opened Jun 6, 2025 by
tianyu-l
Loading…
set up action to publish to PyPI on release
CLA Signed
This label is managed by the Meta Open Source bot.
#1270
opened Jun 6, 2025 by
tianyu-l
Loading…
[llama4] enable expert parallel on the same device mesh as tp (tp2ep)
CLA Signed
This label is managed by the Meta Open Source bot.
#1269
opened Jun 6, 2025 by
hann-wang
Loading…
Fix lr scheduler
CLA Signed
This label is managed by the Meta Open Source bot.
release blocking
Issues that are blocking the milestone / release completion
Added support for creating ROCm docker image for torchtian & run torchtitan tests on ROCm.
CLA Signed
This label is managed by the Meta Open Source bot.
module: rocm
#1260
opened Jun 4, 2025 by
akashveramd
•
Draft
[WIP][Blackwell Kernels] Blackwell group gemm and dense gemms with Python Cutlass
CLA Signed
This label is managed by the Meta Open Source bot.
#1256
opened Jun 3, 2025 by
lessw2020
Loading…
alternative implementation of create_indices_from_offsets_nosync compatible with torch.compile
CLA Signed
This label is managed by the Meta Open Source bot.
#1251
opened Jun 1, 2025 by
hann-wang
Loading…
[float8] add float8 rowwise MoE prototype
CLA Signed
This label is managed by the Meta Open Source bot.
#1245
opened May 30, 2025 by
danielvegamyhre
•
Draft
Implement initial_load_path for checkpointer
CLA Signed
This label is managed by the Meta Open Source bot.
release blocking
Issues that are blocking the milestone / release completion
[cp][flex_attention] integration test trial
CLA Signed
This label is managed by the Meta Open Source bot.
[Flux] Add batched inference
CLA Signed
This label is managed by the Meta Open Source bot.
#1227
opened May 27, 2025 by
CarlosGomes98
Loading…
[WIP] Implement the feature to save unsharded weights at the last step
CLA Signed
This label is managed by the Meta Open Source bot.
#1219
opened May 23, 2025 by
fegin
Loading…
[WIP][Experimental] Activation Offloading
CLA Signed
This label is managed by the Meta Open Source bot.
#1218
opened May 23, 2025 by
lessw2020
Loading…
[WIP][DeepSeek] DeepSeek training and component integration with Titan main components
CLA Signed
This label is managed by the Meta Open Source bot.
#1183
opened May 13, 2025 by
lessw2020
Loading…
compile: turn off fullgraph=True to support llama4
CLA Signed
This label is managed by the Meta Open Source bot.
#1182
opened May 12, 2025 by
bdhirsh
Loading…
[cp][flex_attention] integration test trial
CLA Signed
This label is managed by the Meta Open Source bot.
module: context parallel
[WIP] float8 rowwise all gather
CLA Signed
This label is managed by the Meta Open Source bot.
#1157
opened Apr 30, 2025 by
danielvegamyhre
•
Draft
[WIP] token-expert assignments and layer affinity tracking for expert placement via ILP solving
CLA Signed
This label is managed by the Meta Open Source bot.
#1152
opened Apr 28, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.