ECoRAG: Evidentiality-guided Compression for Long Context RAG

Yeonseok Jeong; Jinsu Kim; Dohyeon Lee; Seung-won Hwang

arXiv:2506.05167·cs.CL·June 9, 2025

ECoRAG: Evidentiality-guided Compression for Long Context RAG

Yeonseok Jeong, Jinsu Kim, Dohyeon Lee, Seung-won Hwang

PDF

Open Access 1 Repo

TL;DR

ECoRAG introduces an evidentiality-guided document compression method for RAG that enhances LLM performance in ODQA by filtering relevant evidence, reducing latency, and minimizing token usage.

Contribution

The paper presents a novel evidentiality-guided compression framework for RAG that selectively retains evidence-supporting information, improving efficiency and accuracy.

Findings

01

Outperforms existing compression methods in ODQA tasks.

02

Reduces latency and token usage significantly.

03

Enhances LLM accuracy by focusing on evidence-supported content.

Abstract

Large Language Models (LLMs) have shown remarkable performance in Open-Domain Question Answering (ODQA) by leveraging external documents through Retrieval-Augmented Generation (RAG). To reduce RAG overhead, from longer context, context compression is necessary. However, prior compression methods do not focus on filtering out non-evidential information, which limit the performance in LLM-based RAG. We thus propose Evidentiality-guided RAG, or ECoRAG framework. ECoRAG improves LLM performance by compressing retrieved documents based on evidentiality, ensuring whether answer generation is supported by the correct evidence. As an additional step, ECoRAG reflects whether the compressed content provides sufficient evidence, and if not, retrieves more until sufficient. Experiments show that ECoRAG improves LLM performance on ODQA tasks, outperforming existing compression methods. Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ldilab/ecorag
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Information Retrieval and Search Behavior

MethodsLayer Normalization · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Byte Pair Encoding · Softmax · Linear Layer · Dropout · Dense Connections · Attention Is All You Need