RecBench

Can LLMs Outshine Conventional Recommenders? A Comparative Evaluation

Installation

gh repo clone Jyonn/RecBench
cd RecBench
pip install -r requirements.txt

📊 Supported Datasets

RecBench supports 15 datasets across domains like news, books, movies, music, fashion, and e-commerce:

📰 MIND: Large-scale Microsoft news data for CTR prediction.
📰 PENS: Personalized news recommendation dataset.
📚 Goodreads: Book reviews and metadata.
📚 Amazon Books: Subset of Amazon product reviews.
🎥 MovieLens: Classic movie rating dataset.
📺 MicroLens: MovieLens dataset with user-item interactions.
📺 Netflix Prize: Large-scale movie rating competition dataset.
🎵 Amazon CDs: Music CD reviews and metadata.
🎵 Last.fm: Music playback logs and tagging data.
👗 H&M: Apparel and fashion product data.
👗 POG: Fashion product reviews and metadata.
📱 Amazon Electronics: Electronics product reviews and metadata.
🎮 Steam: Video game reviews and metadata.
🏨 HotelRec: Hotel recommendation dataset.
️️🍽️ Yelp: Restaurant reviews and metadata.

You can download our preprocessed data from Kaggle (Recommended), Google Drive, and Github Release.

Usage

Example 1: Zero-shot, Pair-wise

python worker.py --model llama1 --data mind

Example 2: Fine-tune, Pair-wise

python tuner.py --model llama1 --train mind --valid mind

Example 3: Fine-tune, List-wise, Unique-ID-based

python seq_processor.py --data mind  # preprocess SeqRec data
python id_coder.py --data mind --seq true  # use unique identifier to represent items
python seq_tuner.py --model llama1seq --data mind --code_path ./code/mind.id.seq.code

Example 4: Fine-tune, List-wise, Semantic-ID-based

python embedder.py --data mind --model llama1  # extract item embeddings 
python code_generator.py --data mind --model llama1  # use RQ-VAE for discrete tokenization
python seq_tuner.py --model llama1seq --data mind --code_path ./code/mind.llama1.seq.code

More documentations will be available soon.

Updates

2025-03-07: Our first benchmark paper is posted on arXiv: Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders.
2024-12-15: RecBench v1 library is released.
2024-06-04: RecBench project is initiated.

Citations

If you find RecBench useful in your research, please consider citing our project:

@article{liu2025benchmarking,
  title={Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders},
  author={Liu, Qijiong and Zhu, Jieming and Fan, Lu and Wang, Kun and Hu, Hengchang and Guo, Wei and Liu, Yong and Wu, Xiao-Ming},
  journal={arXiv preprint arXiv:2503.05493},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
loader		loader
model		model
process		process
seq_model		seq_model
seq_process		seq_process
service		service
utils		utils
.code		.code
.data		.data
.model		.model
LICENSE		LICENSE
README.md		README.md
analyser.py		analyser.py
code_analyser.py		code_analyser.py
code_generator.py		code_generator.py
code_helper.py		code_helper.py
dc_tuner.py		dc_tuner.py
embed_tuner.py		embed_tuner.py
embedder.py		embedder.py
evaluator.py		evaluator.py
fuxi_transfer.py		fuxi_transfer.py
generate.py		generate.py
id_coder.py		id_coder.py
inference.py		inference.py
processor.py		processor.py
recbase_processor.py		recbase_processor.py
requirements.txt		requirements.txt
rq_coder.py		rq_coder.py
sc_tuner.py		sc_tuner.py
searcher.py		searcher.py
seq_processor.py		seq_processor.py
seq_tuner.py		seq_tuner.py
sizer.py		sizer.py
summarizer.py		summarizer.py
tsne_applier.py		tsne_applier.py
tuner.py		tuner.py
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RecBench

Installation

📊 Supported Datasets

Usage

Example 1: Zero-shot, Pair-wise

Example 2: Fine-tune, Pair-wise

Example 3: Fine-tune, List-wise, Unique-ID-based

Example 4: Fine-tune, List-wise, Semantic-ID-based

Updates

Citations

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Jyonn/RecBench

Folders and files

Latest commit

History

Repository files navigation

RecBench

Installation

📊 Supported Datasets

Usage

Example 1: Zero-shot, Pair-wise

Example 2: Fine-tune, Pair-wise

Example 3: Fine-tune, List-wise, Unique-ID-based

Example 4: Fine-tune, List-wise, Semantic-ID-based

Updates

Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages