Rethinking Retrieval-Augmentation as Synthesis: A Query-Aware Context Merging Approach
Jiarui Guo, Yuemeng Xu, Zongwei Lv, Yangyujia Wang, Xiaolin Wang, Kan Liu, Tao Lan, Lin Qu, Tong Yang

TL;DR
This paper introduces MergeRAG, a novel retrieval-augmentation framework that dynamically synthesizes retrieved contexts to improve information density and model performance within the limited context window of LLMs.
Contribution
MergeRAG shifts retrieval-augmentation from static filtering to query-aware synthesis using a dual-pathway merging strategy and hierarchical parallel merging, significantly enhancing retrieval effectiveness.
Findings
Achieves up to 13.7 points improvement in F1 score.
Achieves up to 11.5 points improvement in Exact Match.
Outperforms state-of-the-art RAG baselines on standard benchmarks.
Abstract
Retrieval-Augmented Generation (RAG) enables Large Language Models (LLMs) to extend their existing knowledge by dynamically incorporating external information. However, practical deployment is fundamentally constrained by the LLM's finite context window, forcing a trade-off between information sufficiency and token consumption. Standard pipelines address this via a retrieve-then-select strategy, typically retaining only the top-k chunks based on relevance. Nevertheless, this approach is suboptimal: it inherently truncates critical bridging evidence located in the long tail of the relevance distribution, while simultaneously wasting the token budget on semantically redundant high-ranking chunks. In this paper, we rethink retrieval-augmentation as a dynamic optimization problem aimed at maximizing information density. We propose MergeRAG, a novel framework that shifts the paradigm from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Multimodal Machine Learning Applications
