Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
Alex Laitenberger, Christopher D. Manning, Nelson F. Liu

TL;DR
This paper evaluates retrieval-augmented generation methods with long-context language models and finds that simple, structure-preserving baselines like DOS RAG often outperform more complex pipelines in long-context QA tasks.
Contribution
It demonstrates that a straightforward retrieve-then-read approach preserving document structure can be as effective or better than complex multi-stage RAG pipelines for long-context QA.
Findings
DOS RAG matches or outperforms complex methods on long-context QA benchmarks
Maintaining source fidelity and document structure improves retrieval effectiveness
Simplicity in retrieval pipelines can be more beneficial than added complexity
Abstract
With the rise of long-context language models (LMs) capable of processing tens of thousands of tokens in a single context window, do multi-stage retrieval-augmented generation (RAG) pipelines still offer measurable benefits over simpler, single-stage approaches? To assess this question, we conduct a controlled evaluation for QA tasks under systematically scaled token budgets, comparing two recent multi-stage pipelines, ReadAgent and RAPTOR, against three baselines, including DOS RAG (Document's Original Structure RAG), a simple retrieve-then-read method that preserves original passage order. Despite its straightforward design, DOS RAG consistently matches or outperforms more intricate methods on multiple long-context QA benchmarks. We trace this strength to a combination of maintaining source fidelity and document structure, prioritizing recall within effective context windows, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Natural Language Processing Techniques
