Support for OpenAI's gpt-oss open-weight models #1674
Replies: 3 comments 17 replies
-
It is already supported through ONNX Runtime GenAI. Please check out the Hugging Face repo here for more information. |
Beta Was this translation helpful? Give feedback.
-
Thanks @kunal-vaishnavi for opening the PR, I followed the steps building the repo for source from your branch and using the nightly version of onnxruntime and running:
The output model.onnx.data was 40GB isn't it huge? |
Beta Was this translation helpful? Give feedback.
-
The PR to create your own ONNX models for OpenAI's gpt-oss has now been merged. The kernel changes needed to support running the CPU and WebGPU variants are in progress inside ONNX Runtime. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Please support https://openai.com/open-models/
Beta Was this translation helpful? Give feedback.
All reactions