Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[V0 deprecation] Remove _set_default_args_v0 function
#25409 opened Sep 22, 2025 by Isotr0py Loading…
1 of 5 tasks
Roll back uniform decode with mixed batch cudagraph v1
#25407 opened Sep 22, 2025 by MatthewBonanni Loading…
5 tasks
[Speculators][Speculative Decoding] Fix gpt-oss eagle3 accuracy issue bug Something isn't working gpt-oss Related to GPT-OSS models llama Related to Llama models speculative-decoding v1
#25406 opened Sep 22, 2025 by jiahanc Loading…
5 tasks
[Core] Optimize LoRA weight loading ready ONLY add when PR is ready to merge/full CI is needed
#25403 opened Sep 22, 2025 by jeejeelee Loading…
5 tasks
[V1] Remove V0 code paths for Hybrid models v1
#25400 opened Sep 22, 2025 by tdoublep Draft
5 tasks
Add H100 fused MoE config
#25398 opened Sep 22, 2025 by skyloevil Loading…
[CI Failure] Fix fp8 kv cache on <SM90 ci-failure Issue about an unexpected test failure in CI ready ONLY add when PR is ready to merge/full CI is needed
#25396 opened Sep 22, 2025 by mgoin Loading…
5 tasks
[CI/Build] Skip Qwen3-VL initialization tests until models are actually released qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#25394 opened Sep 22, 2025 by DarkLight1337 Loading…
5 tasks
[Compiler] Disable Inductor standalone compile by default ready ONLY add when PR is ready to merge/full CI is needed
#25391 opened Sep 22, 2025 by ElizaWszola Loading… v0.11.0
[Ray][CPU] Ray executor and Ray DP support for CPU backend ci/build documentation Improvements or additions to documentation v1
#25386 opened Sep 22, 2025 by alex-coniasse Loading…
5 tasks
[feat] Support MRoPE + YaRN
#25384 opened Sep 22, 2025 by JJJYmmm Loading…
[BugFix][ModelRunner] properly handle unused buffer v1
#25380 opened Sep 22, 2025 by yma11 Loading…
[V0 Deprecation][KVConnector] Remove KVConnector v1/v0 differentiation ci/build documentation Improvements or additions to documentation kv-connector ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#25376 opened Sep 22, 2025 by NickLucche Loading…
[Model] Support multi-vector retrieval documentation Improvements or additions to documentation qwen Related to Qwen models
#25370 opened Sep 22, 2025 by noooop Draft
5 tasks
[Docs] Fix griffe warnings in vllm/lora/ops
#25369 opened Sep 22, 2025 by windsonsea Loading…
[Docs] wheel larger than limit documentation Improvements or additions to documentation
#25367 opened Sep 22, 2025 by pfk-beta Loading…
[Bugfix] Qwen3-next generate ! always qwen Related to Qwen models
#25365 opened Sep 22, 2025 by yych0745 Loading…
5 tasks
[Core] Enable KV cache connector + hybrid allocator kv-connector tpu Related to Google TPUs v1
#25363 opened Sep 22, 2025 by KuntaiDu Loading…
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.