Is Relevance Propagated from Retriever to Generator in RAG?
Fangzheng Tian, Debasis Ganguly, Craig Macdonald

TL;DR
This paper empirically investigates whether topical relevance of retrieved documents in RAG systems correlates with improved downstream task performance, revealing nuanced insights about relevance, context size, and retrieval model effectiveness.
Contribution
It provides the first empirical analysis of the relevance-utility relationship in RAG, focusing on topical overlap rather than answer containment.
Findings
Small positive correlation between relevance and utility.
Correlation decreases as context size increases.
Better retrieval models improve downstream RAG performance.
Abstract
Retrieval Augmented Generation (RAG) is a framework for incorporating external knowledge, usually in the form of a set of documents retrieved from a collection, as a part of a prompt to a large language model (LLM) to potentially improve the performance of a downstream task, such as question answering. Different from a standard retrieval task's objective of maximising the relevance of a set of top-ranked documents, a RAG system's objective is rather to maximise their total utility, where the utility of a document indicates whether including it as a part of the additional contextual information in an LLM prompt improves a downstream task. Existing studies investigate the role of the relevance of a RAG context for knowledge-intensive language tasks (KILT), where relevance essentially takes the form of answer containment. In contrast, in our work, relevance corresponds to that of topical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Dense Connections · Linear Warmup With Linear Decay · Linear Layer · BART · Layer Normalization · Attention Dropout · Residual Connection
