Better RAG using Relevant Information Gain
Marc Pickett, Jeremy Hartman, Ayan Kumar Bhowmick, Raquib-ul Alam,, Aditya Vempaty

TL;DR
This paper introduces a new metric based on relevant information gain to improve retrieval diversity and relevance in RAG systems, leading to state-of-the-art question answering performance.
Contribution
The paper presents a novel optimization metric for retrieval that naturally balances relevance and diversity without explicit trade-offs.
Findings
Achieves state-of-the-art results on RGB benchmark
Outperforms existing relevance-diversity metrics
Enhances RAG performance with a simple optimization approach
Abstract
A common way to extend the memory of large language models (LLMs) is by retrieval augmented generation (RAG), which inserts text retrieved from a larger memory into an LLM's context window. However, the context window is typically limited to several thousand tokens, which limits the number of retrieved passages that can inform a model's response. For this reason, it's important to avoid occupying context window space with redundant information by ensuring a degree of diversity among retrieved passages. At the same time, the information should also be relevant to the current task. Most prior methods that encourage diversity among retrieved results, such as Maximal Marginal Relevance (MMR), do so by incorporating an objective that explicitly trades off diversity and relevance. We propose a novel simple optimization metric based on relevant information gain, a probabilistic measure of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSensor Technology and Measurement Systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Attention Dropout · Linear Warmup With Linear Decay · Residual Connection · Adam · Dropout · Byte Pair Encoding · Layer Normalization
