Leveraging Passage Embeddings for Efficient Listwise Reranking with   Large Language Models

Qi Liu; Bo Wang; Nan Wang; Jiaxin Mao

arXiv:2406.14848·cs.CL·January 29, 2025

Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models

Qi Liu, Bo Wang, Nan Wang, Jiaxin Mao

PDF

Open Access 1 Repo

TL;DR

PE-Rank enhances passage reranking efficiency by using passage embeddings as context, reducing input length and decoding time, while maintaining competitive effectiveness in large language model-based listwise reranking.

Contribution

The paper introduces PE-Rank, a novel method that leverages passage embeddings for efficient listwise reranking with LLMs, addressing input length and latency issues.

Findings

01

Significant efficiency improvements in reranking speed.

02

Maintains competitive ranking effectiveness.

03

Applicable across multiple benchmark datasets.

Abstract

Recent studies have demonstrated the effectiveness of using large language language models (LLMs) in passage ranking. The listwise approaches, such as RankGPT, have become new state-of-the-art in this task. However, the efficiency of RankGPT models is limited by the maximum context length and relatively high latency of LLM inference. To address these issues, in this paper, we propose PE-Rank, leveraging the single passage embedding as a good context compression for efficient listwise passage reranking. By treating each passage as a special token, we can directly input passage embeddings into LLMs, thereby reducing input length. Additionally, we introduce an inference method that dynamically constrains the decoding space to these special tokens, accelerating the decoding process. For adapting the model to reranking, we employ listwise learning to rank loss for training. Evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuqi6777/pe_rank
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems