An Investigation of Prompt Variations for Zero-shot LLM-based Rankers
Shuoqi Sun, Shengyao Zhuang, Shuai Wang, Guido Zuccon

TL;DR
This paper systematically studies how prompt components and wording variations influence zero-shot LLM-based rankers, revealing that prompt design can significantly impact effectiveness, sometimes more than the underlying ranking algorithm or LLM choice.
Contribution
It provides a large-scale analysis demonstrating that prompt components and wording choices critically affect zero-shot LLM ranking performance, often surpassing algorithm and backbone differences.
Findings
Prompt wording significantly impacts ranking effectiveness.
Prompt design can outweigh algorithm and LLM differences.
Prompt variations can blur distinctions between different ranking methods.
Abstract
We provide a systematic understanding of the impact of specific components and wordings used in prompts on the effectiveness of rankers based on zero-shot Large Language Models (LLMs). Several zero-shot ranking methods based on LLMs have recently been proposed. Among many aspects, methods differ across (1) the ranking algorithm they implement, e.g., pointwise vs. listwise, (2) the backbone LLMs used, e.g., GPT3.5 vs. FLAN-T5, (3) the components and wording used in prompts, e.g., the use or not of role-definition (role-playing) and the actual words used to express this. It is currently unclear whether performance differences are due to the underlying ranking algorithm, or because of spurious factors such as better choice of words used in prompts. This confusion risks to undermine future research. Through our large-scale experimentation and analysis, we find that ranking algorithms do…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
