Skip to content

Conversation

@techkang
Copy link
Collaborator

What does this PR do?

As described in title, add a doc for vllm+megatron training.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds a new documentation page for best practices on training large models with Verl, using DAPO on Qwen3-235B as an example. The documentation is comprehensive and provides valuable, detailed guidance on configuring various parameters. My review focuses on ensuring clarity and minimizing potential risks for users. I've identified a couple of points that could be improved: the model name used in the example might be confusing as it doesn't seem to be public, and a dependency link points to a personal repository, which could be a supply chain risk. Addressing these would enhance the quality and trustworthiness of the documentation.

* -
- ``actor_rollout_ref.actor.megatron.use_mbridge``
- 是否启用 mbridge 格式转换
- 使用 Megatron 训练时建议开启;需配合最新版 mbridge(详见 https://github.com/ISEEKYAN/mbridge)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The documentation for the mbridge dependency points to a personal GitHub repository (https://github.com/ISEEKYAN/mbridge). For a production-oriented framework like verl, relying on personal repositories for dependencies can be a supply chain security and maintenance risk. It is recommended to link to an official, organization-owned repository or the project's page on PyPI to ensure stability and trust.

@techkang techkang marked this pull request as draft October 31, 2025 11:12
@techkang techkang marked this pull request as ready for review October 31, 2025 12:16
@wuxibin89 wuxibin89 merged commit 2769986 into volcengine:main Nov 3, 2025
4 checks passed
NenoL2001 pushed a commit to NenoL2001/verl that referenced this pull request Nov 3, 2025
### What does this PR do?
As described in title,  add a doc for vllm+megatron training.
chenjiaoAngel added a commit to chenjiaoAngel/verl that referenced this pull request Nov 14, 2025
### What does this PR do?
As described in title,  add a doc for vllm+megatron training.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants