FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya, Swain, Avirup Sil, Heng Ji

TL;DR
The paper introduces FIRST, a more efficient listwise LLM reranking method that speeds up inference by 50% and improves ranking accuracy, demonstrating practical benefits in retrieval tasks and relevance feedback.
Contribution
FIRST leverages the first generated token logits for faster reranking and incorporates a learning-to-rank loss, advancing listwise LLM reranking efficiency and effectiveness.
Findings
FIRST accelerates inference by 50%.
Maintains robust ranking performance across benchmarks.
Improves retriever recall with relevance feedback.
Abstract
Large Language Models (LLMs) have significantly advanced the field of information retrieval, particularly for reranking. Listwise LLM rerankers have showcased superior performance and generalizability compared to existing supervised approaches. However, conventional listwise LLM reranking methods lack efficiency as they provide ranking output in the form of a generated ordered sequence of candidate passage identifiers. Further, they are trained with the typical language modeling objective, which treats all ranking errors uniformly--potentially at the cost of misranking highly relevant passages. Addressing these limitations, we introduce FIRST, a novel listwise LLM reranking approach leveraging the output logits of the first generated identifier to directly obtain a ranked ordering of the candidates. Further, we incorporate a learning-to-rank loss during training, prioritizing ranking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsError Correcting Code Techniques · Cooperative Communication and Network Coding · Coding theory and cryptography
