FIRST: Faster Improved Listwise Reranking with Single Token Decoding

Revanth Gangi Reddy; JaeHyeok Doo; Yifei Xu; Md Arafat Sultan; Deevya; Swain; Avirup Sil; Heng Ji

arXiv:2406.15657·cs.IR·June 25, 2024

FIRST: Faster Improved Listwise Reranking with Single Token Decoding

Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya, Swain, Avirup Sil, Heng Ji

PDF

Open Access 1 Repo 2 Models 1 Datasets

TL;DR

The paper introduces FIRST, a more efficient listwise LLM reranking method that speeds up inference by 50% and improves ranking accuracy, demonstrating practical benefits in retrieval tasks and relevance feedback.

Contribution

FIRST leverages the first generated token logits for faster reranking and incorporates a learning-to-rank loss, advancing listwise LLM reranking efficiency and effectiveness.

Findings

01

FIRST accelerates inference by 50%.

02

Maintains robust ranking performance across benchmarks.

03

Improves retriever recall with relevance feedback.

Abstract

Large Language Models (LLMs) have significantly advanced the field of information retrieval, particularly for reranking. Listwise LLM rerankers have showcased superior performance and generalizability compared to existing supervised approaches. However, conventional listwise LLM reranking methods lack efficiency as they provide ranking output in the form of a generated ordered sequence of candidate passage identifiers. Further, they are trained with the typical language modeling objective, which treats all ranking errors uniformly--potentially at the cost of misranking highly relevant passages. Addressing these limitations, we introduce FIRST, a novel listwise LLM reranking approach leveraging the output logits of the first generated identifier to directly obtain a ranked ordering of the candidates. Further, we incorporate a learning-to-rank loss during training, prioritizing ranking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gangiswag/llm-reranker
jaxOfficial

Models

Datasets

quicktensor/blockrank-beir-evals
dataset· 45 dl
45 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsError Correcting Code Techniques · Cooperative Communication and Network Coding · Coding theory and cryptography