Better RAG using Relevant Information Gain

Marc Pickett; Jeremy Hartman; Ayan Kumar Bhowmick; Raquib-ul Alam,; Aditya Vempaty

arXiv:2407.12101·cs.CL·February 14, 2025

Better RAG using Relevant Information Gain

Marc Pickett, Jeremy Hartman, Ayan Kumar Bhowmick, Raquib-ul Alam,, Aditya Vempaty

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new metric based on relevant information gain to improve retrieval diversity and relevance in RAG systems, leading to state-of-the-art question answering performance.

Contribution

The paper presents a novel optimization metric for retrieval that naturally balances relevance and diversity without explicit trade-offs.

Findings

01

Achieves state-of-the-art results on RGB benchmark

02

Outperforms existing relevance-diversity metrics

03

Enhances RAG performance with a simple optimization approach

Abstract

A common way to extend the memory of large language models (LLMs) is by retrieval augmented generation (RAG), which inserts text retrieved from a larger memory into an LLM's context window. However, the context window is typically limited to several thousand tokens, which limits the number of retrieved passages that can inform a model's response. For this reason, it's important to avoid occupying context window space with redundant information by ensuring a degree of diversity among retrieved passages. At the same time, the information should also be relevant to the current task. Most prior methods that encourage diversity among retrieved results, such as Maximal Marginal Relevance (MMR), do so by incorporating an objective that explicitly trades off diversity and relevance. We propose a novel simple optimization metric based on relevant information gain, a probabilistic measure of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EmergenceAI/dartboard
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSensor Technology and Measurement Systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Attention Dropout · Linear Warmup With Linear Decay · Residual Connection · Adam · Dropout · Byte Pair Encoding · Layer Normalization