Efficient Long Context Language Model Retrieval with Compression
Minju Seo, Jinheon Baek, Seongyun Lee, Sung Ju Hwang

TL;DR
This paper introduces CoLoR, a passage compression method for Long Context Language Models that enhances retrieval performance by nearly 6% while reducing in-context passage size by almost half, making IR more efficient.
Contribution
The paper presents a novel passage compression approach tailored for LCLM retrieval, trained with synthetic data and preference optimization to improve efficiency and effectiveness.
Findings
Improves retrieval performance by 6% on 9 datasets.
Reduces in-context passage size by a factor of 1.91.
Demonstrates effectiveness of compression in LCLM retrieval.
Abstract
Long Context Language Models (LCLMs) have emerged as a new paradigm to perform Information Retrieval (IR), which enables the direct ingestion and retrieval of information by processing an entire corpus in their single context, showcasing the potential to surpass traditional sparse and dense retrieval methods. However, processing a large number of passages within in-context for retrieval is computationally expensive, and handling their representations during inference further exacerbates the processing time; thus, we aim to make LCLM retrieval more efficient and potentially more effective with passage compression. Specifically, we propose a new compression approach tailored for LCLM retrieval, which is trained to maximize the retrieval performance while minimizing the length of the compressed passages. To accomplish this, we generate the synthetic data, where compressed passages are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
