Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models
Qi Liu, Bo Wang, Nan Wang, Jiaxin Mao

TL;DR
PE-Rank enhances passage reranking efficiency by using passage embeddings as context, reducing input length and decoding time, while maintaining competitive effectiveness in large language model-based listwise reranking.
Contribution
The paper introduces PE-Rank, a novel method that leverages passage embeddings for efficient listwise reranking with LLMs, addressing input length and latency issues.
Findings
Significant efficiency improvements in reranking speed.
Maintains competitive ranking effectiveness.
Applicable across multiple benchmark datasets.
Abstract
Recent studies have demonstrated the effectiveness of using large language language models (LLMs) in passage ranking. The listwise approaches, such as RankGPT, have become new state-of-the-art in this task. However, the efficiency of RankGPT models is limited by the maximum context length and relatively high latency of LLM inference. To address these issues, in this paper, we propose PE-Rank, leveraging the single passage embedding as a good context compression for efficient listwise passage reranking. By treating each passage as a special token, we can directly input passage embeddings into LLMs, thereby reducing input length. Additionally, we introduce an inference method that dynamically constrains the decoding space to these special tokens, accelerating the decoding process. For adapting the model to reranking, we employ listwise learning to rank loss for training. Evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
