R-Search

The Large Language Model (LLM) should output in the following format:

Reasoning process + Search planning results - DAG + [Search results] + Final generated result

Installation

trl: Install from GitHub:

pip install git+https://github.com/huggingface/trl.git

Data clustering uses data/clusters.py, and dataset generation and filtering use data/qa_gen.py.

To train the model, run the following script:

sh train.sh

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
RL		RL
data		data
lagent		lagent
README.md		README.md
ds_stage2.json		ds_stage2.json
local_ds8.yaml		local_ds8.yaml
train.sh		train.sh