Scaling Retrieval Augmented Generation with RAG Fusion: Lessons from an Industry Deployment
Luigi Medrano, Arush Verma, Mukul Chhabra

TL;DR
This paper evaluates retrieval fusion techniques in a real-world RAG system, finding that increased recall from fusion does not translate into better answer quality under production constraints, highlighting the need for holistic evaluation.
Contribution
It provides an empirical assessment of retrieval fusion methods in a production RAG pipeline, revealing their limited benefits under realistic constraints and emphasizing the importance of comprehensive evaluation.
Findings
Retrieval fusion increases raw recall but is neutralized after re-ranking.
Fusion variants do not outperform single-query baselines on KB-level Top-$k$ accuracy.
Fusion introduces latency without improving downstream effectiveness.
Abstract
Retrieval-Augmented Generation (RAG) systems commonly adopt retrieval fusion techniques such as multi-query retrieval and reciprocal rank fusion (RRF) to increase document recall, under the assumption that higher recall leads to better answer quality. While these methods show consistent gains in isolated retrieval benchmarks, their effectiveness under realistic production constraints remains underexplored. In this work, we evaluate retrieval fusion in a production-style RAG pipeline operating over an enterprise knowledge base, with fixed retrieval depth, re-ranking budgets, and latency constraints. Across multiple fusion configurations, we find that retrieval fusion does increase raw recall, but these gains are largely neutralized after re-ranking and truncation. In our setting, fusion variants fail to outperform single-query baselines on KB-level Top- accuracy, with Hit@10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Image Retrieval and Classification Techniques
