Found in the Middle: Permutation Self-Consistency Improves Listwise   Ranking in Large Language Models

Raphael Tang; Xinyu Zhang; Xueguang Ma; Jimmy Lin; Ferhan Ture

arXiv:2310.07712·cs.CL·April 23, 2024·6 cites

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models

Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces permutation self-consistency, a novel method that reduces positional bias in large language models for listwise ranking by marginalizing over prompt orderings, leading to improved ranking accuracy.

Contribution

The paper proposes permutation self-consistency, a new approach that enhances listwise ranking in LLMs by marginalizing prompt order biases, with theoretical guarantees and empirical improvements.

Findings

01

Improves ranking scores by up to 18% on GPT-3.5.

02

Surpasses previous state-of-the-art in passage reranking.

03

Provides theoretical proof of robustness and convergence.

Abstract

Large language models (LLMs) exhibit positional bias in how they use context, which especially complicates listwise ranking. To address this, we propose permutation self-consistency, a form of self-consistency over ranking list outputs of black-box LLMs. Our key idea is to marginalize out different list orders in the prompt to produce an order-independent ranking with less positional bias. First, given some input prompt, we repeatedly shuffle the list in the prompt and pass it through the LLM while holding the instructions the same. Next, we aggregate the resulting sample of rankings by computing the central ranking closest in distance to all of them, marginalizing out prompt order biases in the process. Theoretically, we prove the robustness of our method, showing convergence to the true ranking in the presence of random perturbations. Empirically, on five list-ranking datasets in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

castorini/perm-sc
noneOfficial

Videos

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Layer Normalization · Attention Dropout · Dense Connections · Dropout