-
Notifications
You must be signed in to change notification settings - Fork 39
Pull requests: HabanaAI/vllm-hpu-extension
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix the warmup issue of batch expansion for Deepseek MTP
#343
opened Aug 21, 2025 by
YuJiankang
Loading…
Enable calibration using pile-10k dataset for DeepSeek models
#279
opened Jul 14, 2025 by
yangulei
Loading…
Allow usage of fused_block_softmax_adjustment for Qwen with Lazy
#246
opened Jun 27, 2025 by
mswiniarsk
•
Draft
[SW-225565] Enable triangular softmax with merged prefill
#197
opened May 26, 2025 by
kamil-kaczor
•
Draft
Add renormalize parameter for FusedMOE's & modify experts_max arg of mixture_of_experts()
#70
opened Jan 9, 2025 by
tangleintel
•
Draft
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.