Batched Self-Consistency Improves LLM Relevance Assessment and Ranking
Anton Korikov, Pan Du, Scott Sanner, Navid Rekabsaz

TL;DR
This paper introduces batched pointwise relevance assessment methods for LLMs, demonstrating they improve ranking quality and efficiency by enabling joint evaluation and better leveraging self-consistency ensembling, especially in large-scale retrieval tasks.
Contribution
It proposes batched pointwise strategies that enhance LLM relevance assessment and ranking by enabling joint passage evaluation and improved self-consistency benefits, reducing latency.
Findings
Batched methods outperform one-by-one approaches in relevance ranking.
Self-consistency benefits are amplified with batching, improving performance.
Latency is significantly reduced with batched approaches.
Abstract
LLM query-passage relevance assessment is typically studied using a one-by-one pointwise (PW) strategy where each LLM call judges one passage at a time. However, this strategy requires as many LLM calls as there are passages while also preventing information sharing between passages. We thus hypothesize that batched PW methods, which evaluate multiple passages per LLM call, can improve not only efficiency but also judgment quality -- by enabling content from multiple passages to be seen jointly. Moreover, batched PW methods may be better suited to harness the test-time scaling benefits of self-consistency -- the ensembling technique of repeating (potentially perturbed) LLM tasks in parallel and aggregating results -- since batching can naturally enable prompt diversification through varied batch permutations and compositions to create more robust ensembles. We evaluate several batched…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Natural Language Processing Techniques
