Efficient Long Context Language Model Retrieval with Compression

Minju Seo; Jinheon Baek; Seongyun Lee; Sung Ju Hwang

arXiv:2412.18232·cs.IR·May 29, 2025

Efficient Long Context Language Model Retrieval with Compression

Minju Seo, Jinheon Baek, Seongyun Lee, Sung Ju Hwang

PDF

Open Access 1 Models 1 Video

TL;DR

This paper introduces CoLoR, a passage compression method for Long Context Language Models that enhances retrieval performance by nearly 6% while reducing in-context passage size by almost half, making IR more efficient.

Contribution

The paper presents a novel passage compression approach tailored for LCLM retrieval, trained with synthetic data and preference optimization to improve efficiency and effectiveness.

Findings

01

Improves retrieval performance by 6% on 9 datasets.

02

Reduces in-context passage size by a factor of 1.91.

03

Demonstrates effectiveness of compression in LCLM retrieval.

Abstract

Long Context Language Models (LCLMs) have emerged as a new paradigm to perform Information Retrieval (IR), which enables the direct ingestion and retrieval of information by processing an entire corpus in their single context, showcasing the potential to surpass traditional sparse and dense retrieval methods. However, processing a large number of passages within in-context for retrieval is computationally expensive, and handling their representations during inference further exacerbates the processing time; thus, we aim to make LCLM retrieval more efficient and potentially more effective with passage compression. Specifically, we propose a new compression approach tailored for LCLM retrieval, which is trained to maximize the retrieval performance while minimizing the length of the compressed passages. To accomplish this, we generate the synthetic data, where compressed passages are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
iaminju/CoLoR-Phi-3-mini-4k-instruct
model· 9 dl· ♡ 1
9 dl♡ 1

Videos

Efficient Long Context Language Model Retrieval with Compression· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling