RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation
Zeyuan Meng, Zixuan Yi, Iadh Ounis

TL;DR
This paper evaluates the RecRanker LLM-based recommendation framework, reproduces its results, identifies data leakage issues, and introduces RecRankerEval, a comprehensive evaluation framework for top-k recommendation models.
Contribution
It provides a reproducibility study of RecRanker, analyzes the impact of its components, and proposes RecRankerEval, an extensible framework for fair evaluation of LLM-based recommendations.
Findings
Reproduction confirms original performance for pairwise and listwise methods.
Data leakage was identified in the pointwise method due to ground-truth prompts.
Performance can be improved with alternative sampling, stronger models, and better LLMs.
Abstract
A recent Large language model (LLM)-based recommendation model, called RecRanker, has demonstrated a superior performance in the top-k recommendation task compared to other models. In particular, RecRanker samples users via clustering, generates an initial ranking list using an initial recommendation model, and fine-tunes an LLM through hybrid instruction tuning to infer user preferences. However, the contribution of each core component remains underexplored. In this work, we inspect the reproducibility of RecRanker, and study the impact and role of its various components. We begin by reproducing the RecRanker pipeline through the implementation of all its key components. Our reproduction shows that the pairwise and listwise methods achieve a performance comparable to that reported in the original paper. For the pointwise method, while we are also able to reproduce the original paper's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Big Data and Digital Economy
