Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci: skip GB200-4_GPUs-PyTorch-1 test stage
#6377 opened Jul 26, 2025 by QiJune Loading…
[infra] Add an auto-labeling github action to TRTLLM
#6373 opened Jul 25, 2025 by poweiw Loading…
Downgrade CUBLAS to 12.9.0.13-1
#6372 opened Jul 25, 2025 by yuanjingx87 Loading…
feat: Add LoRA support for Gemma3
#6371 opened Jul 25, 2025 by brb-nv Loading…
[None][infra]Update slurm config keys
#6370 opened Jul 25, 2025 by yuanjingx87 Loading…
Draft: Pytorch + disagg + pp
#6369 opened Jul 25, 2025 by pcastonguay Loading…
chore: Improve the AutoTuner log information.
#6368 opened Jul 25, 2025 by hyukn Loading…
chore: disallow arbitrary in llm_args.Configs
#6367 opened Jul 25, 2025 by Superjomn Loading…
test: skip post blackwell
#6357 opened Jul 25, 2025 by xinhe-nv Draft
doc: Add README for wide EP
#6356 opened Jul 25, 2025 by kaiyux Loading…
fix: Fix max attn window in TRTLLM Sampler.
#6354 opened Jul 25, 2025 by dcampora Loading…
feat: Pytorch-backend Phi4MM model update
#6353 opened Jul 25, 2025 by Wanli-Jiang Loading…
[test] Unwaive mistral3.1 small E2E test
#6352 opened Jul 25, 2025 by 2ez4bz Loading…
chore: remove unused code in PyExecutor
#6351 opened Jul 25, 2025 by QiJune Loading…
Refactor dataTransciever classes
#6348 opened Jul 25, 2025 by Tabrizian Loading…
[feat] Add long data collection dataset support
#6347 opened Jul 25, 2025 by yweng0828 Loading…
ProTip! Follow long discussions with comments:>50.