Large Language Models are Zero-Shot Rankers for Recommender Systems
Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian, McAuley, Wayne Xin Zhao

TL;DR
This paper explores the use of large language models as zero-shot rankers in recommender systems, revealing their potential and limitations, and proposing strategies to improve their ranking performance.
Contribution
It formalizes recommendation as a conditional ranking task for LLMs and introduces prompting and bootstrapping methods to enhance zero-shot ranking capabilities.
Findings
LLMs show promising zero-shot ranking abilities.
They struggle with perceiving the order of interactions and are biased by popularity.
Prompting strategies can mitigate these issues.
Abstract
Recently, large language models (LLMs) (e.g., GPT-4) have demonstrated impressive general-purpose task-solving abilities, including the potential to approach recommendation tasks. Along this line of research, this work aims to investigate the capacity of LLMs that act as the ranking model for recommender systems. We first formalize the recommendation problem as a conditional ranking task, considering sequential interaction histories as conditions and the items retrieved by other candidate generation models as candidates. To solve the ranking task by LLMs, we carefully design the prompting template and conduct extensive experiments on two widely-used datasets. We show that LLMs have promising zero-shot ranking abilities but (1) struggle to perceive the order of historical interactions, and (2) can be biased by popularity or item positions in the prompts. We demonstrate that these issues…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Multimodal Machine Learning Applications
