Adaptive Retrieval-Augmented Generation for Conversational Systems
Xi Wang, Procheta Sen, Ruizhe Li, Emine Yilmaz

TL;DR
This paper introduces RAGate, a gating model that predicts when a conversational system should use external knowledge retrieval to improve response quality, optimizing the use of retrieval-augmented generation in dialogue systems.
Contribution
The study proposes RAGate, a novel adaptive gating mechanism that determines the necessity of retrieval augmentation for each conversational turn, enhancing response relevance and confidence.
Findings
RAGate effectively identifies when to use retrieval augmentation.
High correlation between generation confidence and relevance of augmented knowledge.
Improved response quality with adaptive retrieval control.
Abstract
Despite the success of integrating large language models into the development of conversational systems, many studies have shown the effectiveness of retrieving and augmenting external knowledge for informative responses. Hence, many existing studies commonly assume the always need for Retrieval Augmented Generation (RAG) in a conversational system without explicit control. This raises a research question about such a necessity. In this study, we propose to investigate the need for each turn of system response to be augmented with external knowledge. In particular, by leveraging human judgements on the binary choice of adaptive augmentation, we develop RAGate, a gating model, which models conversation context and relevant inputs to predict if a conversational system requires RAG for improved responses. We conduct extensive experiments on devising and applying RAGate to conversational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSpeech and dialogue systems · Advanced Text Analysis Techniques · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Dense Connections · Dropout · Linear Layer · Attention Dropout · Residual Connection · Linear Warmup With Linear Decay
