NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 1.6k
Star 11.1k

Code
Issues 711
Pull requests 362
Discussions
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 44 Milestones 1

New pull request New

362 Open 3,073 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[TRTLLM-6674][Breaking Change] Hopper SWA non-cyclic kernels + KV reuse + Spec Dec

#6379 opened Jul 26, 2025 by symphonylyh

Loading…

1 task

ci: skip GB200-4_GPUs-PyTorch-1 test stage

#6377 opened Jul 26, 2025 by QiJune

Loading…

Publish N-Gram tech blog in README.

#6376 opened Jul 26, 2025 by SimengLiu-nv • Draft

[infra] Add an auto-labeling github action to TRTLLM

#6373 opened Jul 25, 2025 by poweiw

Loading…

Downgrade CUBLAS to 12.9.0.13-1

#6372 opened Jul 25, 2025 by yuanjingx87

Loading…

feat: Add LoRA support for Gemma3

#6371 opened Jul 25, 2025 by brb-nv

Loading…

[None][infra]Update slurm config keys

#6370 opened Jul 25, 2025 by yuanjingx87

Loading…

Draft: Pytorch + disagg + pp

#6369 opened Jul 25, 2025 by pcastonguay

Loading…

chore: Improve the AutoTuner log information.

#6368 opened Jul 25, 2025 by hyukn

Loading…

chore: disallow arbitrary in llm_args.Configs

#6367 opened Jul 25, 2025 by Superjomn

Loading…

chore: add _prepare_and_schedule_batch function in PyExecutor

#6365 opened Jul 25, 2025 by QiJune

Loading…

[TRTLLM-6392][feat] Support turning on/off spec decoding dynamically Community want to contribute

PRs initiated from Community

#6363 opened Jul 25, 2025 by ziyixiong-nv

Loading…

Add C++ RequestSpecificException

#6362 opened Jul 25, 2025 by Shunkangz • Draft

[nvbug/5320234] fix: test_trtllm_bench_llmapi_launch

#6359 opened Jul 25, 2025 by Superjomn

Loading…

test: skip post blackwell

#6357 opened Jul 25, 2025 by xinhe-nv • Draft

doc: Add README for wide EP

#6356 opened Jul 25, 2025 by kaiyux

Loading…

[https://nvbugs/5340941][https://nvbugs/5375785] - fix: Wrap attentio…

#6355 opened Jul 25, 2025 by liji-nv

Loading…

fix: Fix max attn window in TRTLLM Sampler.

#6354 opened Jul 25, 2025 by dcampora

Loading…

feat: Pytorch-backend Phi4MM model update

#6353 opened Jul 25, 2025 by Wanli-Jiang

Loading…

[test] Unwaive mistral3.1 small E2E test

#6352 opened Jul 25, 2025 by 2ez4bz

Loading…

chore: remove unused code in PyExecutor

#6351 opened Jul 25, 2025 by QiJune

Loading…

chore: add warning for the default backend on serve and bench commands

#6350 opened Jul 25, 2025 by Superjomn

Loading…

Add disable_optimistic_tuning flag and update gb_per_token calculation title

#6349 opened Jul 25, 2025 by venkywonka

Loading…

Refactor dataTransciever classes

#6348 opened Jul 25, 2025 by Tabrizian

Loading…

[feat] Add long data collection dataset support

#6347 opened Jul 25, 2025 by yweng0828

Loading…

Previous 1 2 3 4 5 … 14 15 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!