[model] add gpt oss #8826

hiyouga · 2025-08-05T21:07:56Z

Apply LoRA Fine-Tuning on GPT-OSS model in 3 steps

1. Install LlamaFactory and transformers

git clone --depth 1 https://github.com/hiyouga/LLaMA-Factory.git
cd LLaMA-Factory
pip install -e ".[torch,metrics]" --no-build-isolation
pip install "transformers==4.55.0"

2. Train GPT-OSS on a single GPU (> 44GB) (multi-GPU is also supported)

llamafactory-cli train examples/train_lora/gpt_lora_sft.yaml

3. Merge the LoRA weight into the base model

llamafactory-cli export --model_name_or_path openai/gpt-oss-20b --adapter_name_or_path saves/gpt-20b/lora/sft --export_dir gpt_merged

(Optional) Chat with the fine-tuned model

llamafactory-cli chat --model_name_or_path gpt_merged --template gpt --skip_special_tokens False

Full fine-tuning recipes

See #8837

Use Web UI to fine-tune the model:

yuimo · 2025-08-06T05:36:42Z

what is the weight precision after finetune? is it still mxfp4 for the moe layer?

ziheng0924 · 2025-08-06T06:21:18Z

When will full-parameter fine-tuning be supported?

hiyouga · 2025-08-06T10:08:02Z

@ziheng0924 full finetuning is supported

hiyouga · 2025-08-06T10:08:42Z

@yuimo the lora weights will be fp32 format

PROoshio · 2025-08-06T11:58:17Z

VLLM推理什么时候支持呢？

BenjaminBossan · 2025-08-06T12:47:37Z

Hi, I just saw this PR to support gpt-oss. IIUC, with this recipe, only the nn.Linear layers are being targeted. To target the MoE layers, you'd have to use a new PEFT feature, namely LoraConfig(target_parameters=[...]). The OpenAI cookbook has an example of that.

Generally, targeting just the MoE layers may be fine, but the majority of parameters reside in the MoE layers (90% for 20b), so users may want to target those too. On the other hand, this will be much more expensive memory-wise, so there is a trade-off here. If you have questions about the new PEFT feature, LMK.

hiyouga · 2025-08-06T15:08:43Z

@BenjaminBossan Sure, I agree with you. I'm excited to dive into exploring these new PEFT features. Thank you for pointing it out.

RalphMao · 2025-08-06T22:32:37Z

VLLM推理什么时候支持呢？

vLLM doesn't support bf16 ckpt yet

vllm-project/vllm#22380

Pikachu1412 · 2025-08-07T02:25:34Z

加载oss 120b模型时CPU内存异常增长，导致oom

KosmoCHE · 2025-08-07T03:00:25Z

what is the weight precision after finetune? is it still mxfp4 for the moe layer?

it will be bf16, if triton < 3.4.0

Opdoop · 2025-08-07T04:37:56Z

After finetuning the gpt-oss model, I have a BF16 weight. Is there any tool to create the MXFP4 weight from BF16 weight?

liuqianchao · 2025-08-07T06:04:20Z

+1, is there any method to convert a trained SFT model in bf16 to mxfp4, or to directly train using native mxfp4?

bobzhang208 · 2025-08-09T03:00:05Z

微调gpt-oss-20b显存占用大概是什么样子呢，在A100上占用大概为50，单是为什么4*3090 24G会oom

WeiminWu2000 · 2025-08-11T04:32:57Z

微调gpt-oss-20b看起来好像不支持 linger-kernel，input context length比较长，内存会爆掉，请问有解决办法来支持linger-kernel吗

Imbernoulli · 2025-08-22T01:23:59Z

请问gpt-oss-120b全参数微调最少要多少卡？

Lei-Tin · 2025-09-03T03:49:59Z

Does this allow us to train with the reasoning content within the prompt? Or do we have to process the prompt to allow the gpt template to take in reasoning content as well?

hiyouga had a problem deploying to docker August 5, 2025 21:08 — with GitHub Actions Error

hiyouga force-pushed the hiyouga/gpt branch from 1f64aa4 to 28f3eb2 Compare August 5, 2025 21:08

hiyouga had a problem deploying to docker August 5, 2025 21:08 — with GitHub Actions Error

hiyouga force-pushed the hiyouga/gpt branch from 28f3eb2 to 6c3b38a Compare August 5, 2025 21:17

hiyouga temporarily deployed to docker August 5, 2025 21:17 — with GitHub Actions Inactive

hiyouga had a problem deploying to docker August 5, 2025 21:17 — with GitHub Actions Error

hiyouga force-pushed the hiyouga/gpt branch from 6c3b38a to 7200d24 Compare August 5, 2025 21:28

hiyouga had a problem deploying to docker August 5, 2025 21:28 — with GitHub Actions Error

hiyouga force-pushed the hiyouga/gpt branch from 7200d24 to bcdb28e Compare August 5, 2025 21:29

hiyouga had a problem deploying to docker August 5, 2025 21:29 — with GitHub Actions Error

hiyouga temporarily deployed to docker August 5, 2025 21:29 — with GitHub Actions Inactive

add gpt oss

6289cfa

hiyouga force-pushed the hiyouga/gpt branch from bcdb28e to 6289cfa Compare August 5, 2025 21:46

hiyouga temporarily deployed to docker August 5, 2025 21:46 — with GitHub Actions Inactive

hiyouga added the solved This problem has been already solved label Aug 5, 2025

hiyouga merged commit 706b3e5 into main Aug 5, 2025
16 checks passed

hiyouga deleted the hiyouga/gpt branch August 5, 2025 21:56

hiyouga mentioned this pull request Aug 5, 2025

Add some links to awesome-gpt-oss.md openai/gpt-oss#28

Merged

kahlun pushed a commit to DataInsightAutomation/LLaMA-Factory that referenced this pull request Aug 8, 2025

[model] add gpt oss (hiyouga#8826)

6dbbd91

dragon18456 mentioned this pull request Aug 21, 2025

OOM Finetuning gpt-oss-120B #8996

Closed

1 task

[model] add gpt oss #8826

[model] add gpt oss #8826

Uh oh!

Conversation

hiyouga commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Apply LoRA Fine-Tuning on GPT-OSS model in 3 steps

1. Install LlamaFactory and transformers

2. Train GPT-OSS on a single GPU (> 44GB) (multi-GPU is also supported)

3. Merge the LoRA weight into the base model

(Optional) Chat with the fine-tuned model

Full fine-tuning recipes

Uh oh!

Uh oh!

yuimo commented Aug 6, 2025

Uh oh!

ziheng0924 commented Aug 6, 2025

Uh oh!

hiyouga commented Aug 6, 2025

Uh oh!

hiyouga commented Aug 6, 2025

Uh oh!

PROoshio commented Aug 6, 2025

Uh oh!

BenjaminBossan commented Aug 6, 2025

Uh oh!

hiyouga commented Aug 6, 2025

Uh oh!

RalphMao commented Aug 6, 2025

Uh oh!

Pikachu1412 commented Aug 7, 2025

Uh oh!

KosmoCHE commented Aug 7, 2025

Uh oh!

Opdoop commented Aug 7, 2025

Uh oh!

liuqianchao commented Aug 7, 2025

Uh oh!

bobzhang208 commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WeiminWu2000 commented Aug 11, 2025

Uh oh!

Imbernoulli commented Aug 22, 2025

Uh oh!

Lei-Tin commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

hiyouga commented Aug 5, 2025 •

edited

Loading

bobzhang208 commented Aug 9, 2025 •

edited

Loading

Lei-Tin commented Sep 3, 2025 •

edited

Loading