Skip to content

Commit 62cc8b9

Browse files
add verl as submodule, update install instructions
1 parent 1b393de commit 62cc8b9

File tree

4 files changed

+16
-6
lines changed

4 files changed

+16
-6
lines changed

.gitmodules

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
[submodule "verl"]
2+
path = verl
3+
url = https://github.com/volcengine/verl.git
4+
branch = main

Dockerfile

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,9 +4,13 @@ ENV DEBIAN_FRONTEND=noninteractive
44

55
WORKDIR /workspace
66

7-
RUN git clone https://github.com/rllm-org/rllm.git rllm
7+
RUN pip uninstall verl -y || true
88

9-
RUN cd rllm && pip install -e .
9+
RUN git clone --recurse-submodules https://github.com/rllm-org/rllm.git rllm
10+
11+
RUN cd rllm && \
12+
pip install --no-deps -e ./verl && \
13+
pip install -e .
1014

1115
RUN pip install playwright && \
1216
playwright install chromium && \

README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -44,15 +44,16 @@ rLLM is an open-source framework for post-training language agents via reinforce
4444

4545
```bash
4646
# Clone the repository
47-
git clone https://github.com/rllm-org/rllm.git
47+
git clone --recurse-submodules https://github.com/rllm-org/rllm.git
4848
cd rllm
4949

5050
# Create a conda environment
5151
conda create -n rllm python=3.10
5252
conda activate rllm
5353

54-
# Install verl v0.5.0
55-
bash scripts/install_verl.sh # (or follow the instructions at https://verl.readthedocs.io/en/v0.5.x/start/install.html)
54+
# Install verl
55+
bash verl/scripts/install_vllm_sglang_mcore.sh
56+
pip install --no-deps -e ./verl
5657

5758
# Install rllm
5859
pip install -e .
@@ -77,7 +78,7 @@ docker exec -it rllm-container bash
7778

7879
## Acknowledgements
7980

80-
- Our training experiments are powered by our heavily modified fork of [verl](https://github.com/volcengine/verl), an open-source RLHF library.
81+
- Our training experiments are powered by [verl](https://github.com/volcengine/verl), an open-source RLHF library.
8182
- Our models are trained on top of [`DeepSeek-R1-Distill-Qwen-1.5B`](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B), [`DeepSeek-R1-Distill-Qwen-14B`](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B), and [`Qwen3-32B`](https://huggingface.co/Qwen/Qwen3-32b).
8283
- Our work is done as part of [Berkeley Sky Computing Lab](https://skycomputing.berkeley.edu/), [Berkeley AI Research](https://bair.berkeley.edu/), and a successful collaboration with Together AI.
8384

verl

Submodule verl added at 8fdc4d3

0 commit comments

Comments
 (0)