An Investigation of Prompt Variations for Zero-shot LLM-based Rankers

Shuoqi Sun; Shengyao Zhuang; Shuai Wang; Guido Zuccon

arXiv:2406.14117·cs.IR·July 28, 2025·1 cites

An Investigation of Prompt Variations for Zero-shot LLM-based Rankers

Shuoqi Sun, Shengyao Zhuang, Shuai Wang, Guido Zuccon

PDF

Open Access 1 Repo

TL;DR

This paper systematically studies how prompt components and wording variations influence zero-shot LLM-based rankers, revealing that prompt design can significantly impact effectiveness, sometimes more than the underlying ranking algorithm or LLM choice.

Contribution

It provides a large-scale analysis demonstrating that prompt components and wording choices critically affect zero-shot LLM ranking performance, often surpassing algorithm and backbone differences.

Findings

01

Prompt wording significantly impacts ranking effectiveness.

02

Prompt design can outweigh algorithm and LLM differences.

03

Prompt variations can blur distinctions between different ranking methods.

Abstract

We provide a systematic understanding of the impact of specific components and wordings used in prompts on the effectiveness of rankers based on zero-shot Large Language Models (LLMs). Several zero-shot ranking methods based on LLMs have recently been proposed. Among many aspects, methods differ across (1) the ranking algorithm they implement, e.g., pointwise vs. listwise, (2) the backbone LLMs used, e.g., GPT3.5 vs. FLAN-T5, (3) the components and wording used in prompts, e.g., the use or not of role-definition (role-playing) and the actual words used to express this. It is currently unclear whether performance differences are due to the underlying ranking algorithm, or because of spurious factors such as better choice of words used in prompts. This confusion risks to undermine future research. Through our large-scale experimentation and analysis, we find that ranking algorithms do…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ielab/zeroshot-rankers-prompt-variations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis