Skip to content

Commit 5758a0a

Browse files
Merge pull request #167 from rllm-org/dev-sijun
update docs
2 parents cdebb89 + 2b4139b commit 5758a0a

File tree

11 files changed

+18
-18
lines changed

11 files changed

+18
-18
lines changed

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@
1515
[![Discord](https://img.shields.io/badge/Discord-5865F2?style=for-the-badge&logo=discord&logoColor=white)](https://discord.gg/BDH46HT9en)
1616
[![Website](https://img.shields.io/badge/Site-%23000000.svg?style=for-the-badge&logo=semanticweb&logoColor=white)](https://www.agentica-project.com)
1717
[![Twitter/X](https://img.shields.io/badge/Agentica-white?style=for-the-badge&logo=X&logoColor=000&color=000&labelColor=white)](https://x.com/Agentica_)
18-
[![Github](https://img.shields.io/badge/RLLM-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/agentica-project/rllm)
18+
[![Github](https://img.shields.io/badge/RLLM-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/rllm-org/rllm)
1919
[![Hugging Face Collection](https://img.shields.io/badge/Agentica-fcd022?style=for-the-badge&logo=huggingface&logoColor=000&labelColor)](https://huggingface.co/agentica-org)
2020

2121
</div>
@@ -32,15 +32,15 @@ rLLM is an open-source framework for post-training language agents via reinforce
3232
- 🍽️ An In-Depth Blog Post on our [SWE Agents and RL Training Recipes](https://pretty-radio-b75.notion.site/DeepSWE-Training-a-Fully-Open-sourced-State-of-the-Art[…]-by-Scaling-RL-22281902c1468193aabbe9a8c59bbe33?pvs=73)
3333
- 🤗 HF Model [`DeepSWE-Preview`](https://huggingface.co/agentica-org/DeepSWE-Preview)
3434
- 🤗 HF Dataset [`R2E-Gym-Subset`](https://huggingface.co/datasets/R2E-Gym/R2E-Gym-Subset)
35-
- 📄 [Training Scripts](https://github.com/agentica-project/rllm/tree/main/examples/swe)
35+
- 📄 [Training Scripts](https://github.com/rllm-org/rllm/tree/main/examples/swe)
3636
- 📈 [Wandb Training Logs](https://wandb.ai/mluo/deepswe)—All training runs and ablations.
3737
- 🔎 [Evaluation Logs](https://drive.google.com/file/d/10LIwpJeaFuiX6Y-qEG2a4a335PEuQJeS/view?usp=sharing)—16 passes over SWE-Bench-Verified.
3838

3939
<strong>[2025/04/08]</strong> We release [`DeepCoder-14B-Preview`](https://pretty-radio-b75.notion.site/DeepCoder-A-Fully-Open-Source-14B-Coder-at-O3-mini-Level-1cf81902c14680b3bee5eb349a512a51), a 14B coding model that achieves an impressive **60.6%** Pass@1 accuracy on LiveCodeBench (+8% improvement), matching the performance of `o3-mini-2025-01-031 (Low)` and `o1-2024-12-17`.
4040
- ⬆️ An In-Depth Blog Post on our [Training Recipe and Insights](https://pretty-radio-b75.notion.site/DeepCoder-A-Fully-Open-Source-14B-Coder-at-O3-mini-Level-1cf81902c14680b3bee5eb349a512a51)
4141
- 🤗 HF Model [`DeepCoder-14B-Preview`](https://huggingface.co/agentica-org/DeepCoder-14B-Preview), [`DeepCoder-1.5B-Preview`](https://huggingface.co/agentica-org/DeepCoder-1.5B-Preview)
4242
- 🤗 HF Dataset [`DeepCoder-Preview-Dataset`](https://huggingface.co/datasets/agentica-org/DeepCoder-Preview-Dataset)
43-
- 📄 [Training Scripts](https://github.com/agentica-project/rllm/tree/main/scripts/deepcoder/train)—Exact hyperparameters we used to achieve `o3-mini` performance.
43+
- 📄 [Training Scripts](https://github.com/rllm-org/rllm/tree/main/scripts/deepcoder/train)—Exact hyperparameters we used to achieve `o3-mini` performance.
4444
- 📈 [Wandb Training Logs](https://wandb.ai/mluo/deepcoder)—All training runs and ablations.
4545
- 🔎 [Evaluation Logs](https://drive.google.com/file/d/1tr_xXvCJnjU0tLO7DNtFL85GIr3aGYln/view?usp=sharing)—LiveCodeBench and Codeforces logs for DeepCoder.
4646

@@ -59,7 +59,7 @@ rLLM is an open-source framework for post-training language agents via reinforce
5959

6060
```bash
6161
# Clone the repository
62-
git clone --recurse-submodules https://github.com/agentica-project/rllm.git
62+
git clone --recurse-submodules https://github.com/rllm-org/rllm.git
6363
cd rllm
6464

6565
# create a conda environment

docs/examples/deepcoder.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -81,4 +81,4 @@ DeepCoder training configuration:
8181
--8<-- "examples/deepcoder/train_deepcoder.py"
8282
```
8383

84-
For detailed setup instructions, see the [README](https://github.com/agentica-project/rllm/blob/main/examples/deepcoder/README.md) in the deepcoder example directory.
84+
For detailed setup instructions, see the [README](https://github.com/rllm-org/rllm/blob/main/examples/deepcoder/README.md) in the deepcoder example directory.

docs/examples/deepscaler.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -83,4 +83,4 @@ DeepScaler training configuration:
8383
--8<-- "examples/deepscaler/train_deepscaler.py"
8484
```
8585

86-
For detailed setup instructions, see the [README](https://github.com/agentica-project/rllm/blob/main/examples/deepscaler/README.md) in the deepscaler example directory.
86+
For detailed setup instructions, see the [README](https://github.com/rllm-org/rllm/blob/main/examples/deepscaler/README.md) in the deepscaler example directory.

docs/examples/frozenlake.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,4 +48,4 @@ Agent training implementation:
4848
--8<-- "examples/frozenlake/train_frozenlake_agent.py"
4949
```
5050

51-
For more details, see the [FrozenLake README](https://github.com/agentica-project/rllm/blob/main/examples/frozenlake/README.md).
51+
For more details, see the [FrozenLake README](https://github.com/rllm-org/rllm/blob/main/examples/frozenlake/README.md).

docs/examples/search.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,4 +53,4 @@ Search agent training configuration:
5353
--8<-- "examples/search/train_search_agent.py"
5454
```
5555

56-
For detailed setup instructions, see the [README](https://github.com/agentica-project/rllm/blob/main/examples/search/README.md) in the search example directory.
56+
For detailed setup instructions, see the [README](https://github.com/rllm-org/rllm/blob/main/examples/search/README.md) in the search example directory.

docs/examples/sft.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -127,4 +127,4 @@ Script for evaluating SFT model performance:
127127
--8<-- "examples/sft/run_sft_model.py"
128128
```
129129

130-
For detailed setup instructions, see the [README](https://github.com/agentica-project/rllm-internal/blob/v0.1/examples/sft/README.md) in the sft example directory.
130+
For detailed setup instructions, see the [README](https://github.com/rllm-org/rllm/blob/main/examples/sft/README.md) in the sft example directory.

docs/examples/swe.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -75,4 +75,4 @@ DeepSWE training configuration:
7575
--8<-- "examples/swe/train_deepswe_agent.py"
7676
```
7777

78-
For detailed setup instructions, see the [README](https://github.com/agentica-project/rllm/blob/main/examples/swe/README.md) in the deepswe example directory.
78+
For detailed setup instructions, see the [README](https://github.com/rllm-org/rllm/blob/main/examples/swe/README.md) in the deepswe example directory.

docs/getting-started/installation.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ rLLM uses [verl](https://github.com/volcengine/verl) as its training backend. Fo
2424

2525
```bash
2626
# Clone the repository
27-
git clone --recurse-submodules https://github.com/agentica-project/rllm.git
27+
git clone --recurse-submodules https://github.com/rllm-org/rllm.git
2828
cd rllm
2929

3030
# create a conda environment
@@ -38,4 +38,4 @@ pip install -e .
3838

3939
This will install rLLM and all its dependencies in development mode.
4040

41-
For more help, refer to the [GitHub issues page](https://github.com/agentica-project/rllm/issues).
41+
For more help, refer to the [GitHub issues page](https://github.com/rllm-org/rllm/issues).

docs/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ rLLM currently supports a variety of built-in agents:
2626
- **Frozenlake Agent**: Train agents to navigate text-based grid world. (useful for testing/debugging RL algorithms)
2727

2828
## 🛠️ Train Your Own Agents & Environments
29-
rLLM is designed to be extensible. You can easily build and train your own custom agents and environments using our modular API and training engine. Walk through our [core concepts](./core-concepts/overview.md) and [examples](https://github.com/agentica-project/rllm/tree/main/examples) to understand the fundamentals of rLLM and build your own custom agents and environments tailored to your specific use cases.
29+
rLLM is designed to be extensible. You can easily build and train your own custom agents and environments using our modular API and training engine. Walk through our [core concepts](./core-concepts/overview.md) and [examples](https://github.com/rllm-org/rllm/tree/main/examples) to understand the fundamentals of rLLM and build your own custom agents and environments tailored to your specific use cases.
3030

3131
## 🚀Future Roadmap
3232

examples/agents/swe/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,12 +13,12 @@
1313
1414
<a href="https://agentica-project.com/" > 🌐 Project Page</a>
1515
16-
<a href="https://github.com/agentica-project/rllm" > 🧑‍💻 Code</a>
16+
<a href="https://github.com/rllm-org/rllm" > 🧑‍💻 Code</a>
1717
</p>
1818

1919
<div align="center">
2020

21-
[![Github](https://img.shields.io/badge/RLLM-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/agentica-project/rllm)
21+
[![Github](https://img.shields.io/badge/RLLM-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/rllm-org/rllm)
2222
[![Website](https://img.shields.io/badge/Site-%23000000.svg?style=for-the-badge&logo=semanticweb&logoColor=white)](https://www.agentica-project.com)
2323
[![Twitter](https://img.shields.io/badge/Agentica-white?style=for-the-badge&logo=X&logoColor=000&color=000&labelColor=white)](https://x.com/Agentica_)
2424
[![Hugging Face Collection](https://img.shields.io/badge/Agentica-fcd022?style=for-the-badge&logo=huggingface&logoColor=000&labelColor)](https://huggingface.co/agentica-org)

0 commit comments

Comments
 (0)