Compressing Long Context for Enhancing RAG with AMR-based Concept   Distillation

Kaize Shi; Xueyao Sun; Qing Li; Guandong Xu

arXiv:2405.03085·cs.CL·May 7, 2024·2 cites

Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation

Kaize Shi, Xueyao Sun, Qing Li, Guandong Xu

PDF

Open Access

TL;DR

This paper introduces a novel AMR-based concept distillation method to compress retrieved documents in RAG, improving focus on vital information and enhancing question-answering performance with long contexts.

Contribution

It presents the first use of AMR for semantic-based context compression in RAG, significantly improving retrieval relevance and LLM inference accuracy.

Findings

01

Outperforms baseline methods in open-domain QA tasks.

02

Maintains robustness across various LLM backbones.

03

Effectively filters irrelevant information as document length increases.

Abstract

Large Language Models (LLMs) have made significant strides in information acquisition. However, their overreliance on potentially flawed parametric knowledge leads to hallucinations and inaccuracies, particularly when handling long-tail, domain-specific queries. Retrieval Augmented Generation (RAG) addresses this limitation by incorporating external, non-parametric knowledge. Nevertheless, the retrieved long-context documents often contain noisy, irrelevant information alongside vital knowledge, negatively diluting LLMs' attention. Inspired by the supportive role of essential concepts in individuals' reading comprehension, we propose a novel concept-based RAG framework with the Abstract Meaning Representation (AMR)-based concept distillation algorithm. The proposed algorithm compresses the cluttered raw retrieved documents into a compact set of crucial concepts distilled from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Advanced Algorithms and Applications · Text and Document Classification Technologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Weight Decay · Attention Dropout · Dropout · Residual Connection · Softmax · WordPiece · Linear Layer