-
Notifications
You must be signed in to change notification settings - Fork 214
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Provide distributed version of improved TopK kernel
#1710
opened Aug 27, 2025 by
hariharans29
Loading…
[DO NOT MERGE] Improve TopK CUDA execution path while sampling and when vocab size is sufficiently large
#1705
opened Aug 26, 2025 by
hariharans29
Loading…
Add support for inference using EP context model
#1691
opened Aug 19, 2025 by
thevishalagarwal
Loading…
Modify Model Builder to build paged attention models
#1605
opened Jul 3, 2025 by
aciddelgado
•
Draft
add extra_options use_channel_wised_quantization to builder.py
#1362
opened Mar 31, 2025 by
bopeng1234
Loading…
Make Microsoft.ML.OnnxRuntimeGenAI.Tokenizer a Microsoft.ML.Tokenizers.Tokenizer
#970
opened Oct 11, 2024 by
stephentoub
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.