RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation

Zeyuan Meng; Zixuan Yi; Iadh Ounis

arXiv:2507.05880·cs.IR·July 9, 2025

RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation

Zeyuan Meng, Zixuan Yi, Iadh Ounis

PDF

Open Access

TL;DR

This paper evaluates the RecRanker LLM-based recommendation framework, reproduces its results, identifies data leakage issues, and introduces RecRankerEval, a comprehensive evaluation framework for top-k recommendation models.

Contribution

It provides a reproducibility study of RecRanker, analyzes the impact of its components, and proposes RecRankerEval, an extensible framework for fair evaluation of LLM-based recommendations.

Findings

01

Reproduction confirms original performance for pairwise and listwise methods.

02

Data leakage was identified in the pointwise method due to ground-truth prompts.

03

Performance can be improved with alternative sampling, stronger models, and better LLMs.

Abstract

A recent Large language model (LLM)-based recommendation model, called RecRanker, has demonstrated a superior performance in the top-k recommendation task compared to other models. In particular, RecRanker samples users via clustering, generates an initial ranking list using an initial recommendation model, and fine-tunes an LLM through hybrid instruction tuning to infer user preferences. However, the contribution of each core component remains underexplored. In this work, we inspect the reproducibility of RecRanker, and study the impact and role of its various components. We begin by reproducing the RecRanker pipeline through the implementation of all its key components. Our reproduction shows that the pairwise and listwise methods achieve a performance comparable to that reported in the original paper. For the pointwise method, while we are also able to reproduce the original paper's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Big Data and Digital Economy