Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Peiran Zhou, Junnan Zhu, Yichen Shen, Ruoxi Yu

TL;DR
CASC is a novel framework that enhances retrieval-augmented generation in complex domains by intelligently synthesizing and compressing retrieved information, leading to more accurate and trustworthy answers.
Contribution
We introduce CASC, a context-adaptive synthesis and compression framework with a specialized module for better information processing in complex, multi-document retrieval scenarios.
Findings
CASC outperforms strong baselines on SciDocs-QA dataset.
Significantly reduces token count and cognitive load for LLMs.
Improves accuracy and trustworthiness in complex scientific QA tasks.
Abstract
Large Language Models (LLMs) excel in language tasks but are prone to hallucinations and outdated knowledge. Retrieval-Augmented Generation (RAG) mitigates these by grounding LLMs in external knowledge. However, in complex domains involving multiple, lengthy, or conflicting documents, traditional RAG suffers from information overload and inefficient synthesis, leading to inaccurate and untrustworthy answers. To address this, we propose CASC (Context-Adaptive Synthesis and Compression), a novel framework that intelligently processes retrieved contexts. CASC introduces a Context Analyzer & Synthesizer (CAS) module, powered by a fine-tuned smaller LLM, which performs key information extraction, cross-document consistency checking and conflict resolution, and question-oriented structured synthesis. This process transforms raw, scattered information into a highly condensed, structured, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
