SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval
Jiatao Li, Xinyu Hu, Xiaojun Wan

TL;DR
SMART-RAG introduces a novel unsupervised, training-free method using Determinantal Point Processes to improve context selection in Retrieval-Augmented Generation, reducing redundancy and conflicting information for better question answering performance.
Contribution
It presents SMART, a new DPP-based framework that optimizes context selection in RAG without training, addressing redundancy and conflict issues in unsupervised retrieval.
Findings
Significantly improves QA performance over previous methods.
Effectively balances relevance, diversity, and conflict in context selection.
Outperforms existing unsupervised context selection approaches.
Abstract
Retrieval-Augmented Generation (RAG) has greatly improved large language models (LLMs) by enabling them to generate accurate, contextually grounded responses through the integration of external information. However, conventional RAG approaches, which prioritize top-ranked documents based solely on query-context relevance, often introduce redundancy and conflicting information. This issue is particularly evident in unsupervised retrieval settings, where there are no mechanisms to effectively mitigate these problems, leading to suboptimal context selection. To address this, we propose Selection using Matrices for Augmented Retrieval (SMART) in question answering tasks, a fully unsupervised and training-free framework designed to optimize context selection in RAG. SMART leverages Determinantal Point Processes (DPPs) to simultaneously model relevance, diversity and conflict, ensuring the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Neural Networks and Applications · Advanced Image and Video Retrieval Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Dense Connections · Multi-Head Attention · Linear Warmup With Linear Decay · Weight Decay · Adam · WordPiece
