Evaluating Position Bias in Large Language Model Recommendations
Ethan Bito, Yongli Ren, Estrid He

TL;DR
This paper investigates position bias in large language model recommendations, revealing systemic biases and proposing a new prompting method, RISE, to mitigate bias and improve recommendation stability without fine-tuning.
Contribution
The paper identifies position bias in LLM-based recommendations and introduces RISE, a prompting strategy that reduces bias and enhances stability without model fine-tuning.
Findings
LLMs exhibit high sensitivity to input order in recommendations.
RISE significantly reduces position bias in LLM recommendations.
Proposed method improves recommendation stability on benchmark datasets.
Abstract
Large Language Models (LLMs) are being increasingly explored as general-purpose tools for recommendation tasks, enabling zero-shot and instruction-following capabilities without the need for task-specific training. While the research community is enthusiastically embracing LLMs, there are important caveats to directly adapting them for recommendation tasks. In this paper, we show that LLM-based recommendation models suffer from position bias, where the order of candidate items in a prompt can disproportionately influence the recommendations produced by LLMs. First, we analyse the position bias of LLM-based recommendations on real-world datasets, where results uncover systemic biases of LLMs with high sensitivity to input orders. Furthermore, we introduce a new prompting strategy to mitigate the position bias of LLM recommendation models called Ranking via Iterative SElection (RISE). We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Advanced Graph Neural Networks
