Evidence Contextualization and Counterfactual Attribution for Conversational QA over Heterogeneous Data with RAG Systems
Rishiraj Saha Roy, Joel Schlotthauer, Chris Hinze, Andreas Foltyn,, Luzian Hahn, Fabian Kuech

TL;DR
This paper introduces RAGONITE, a RAG system that enhances conversational QA by contextualizing evidence and providing causal counterfactual attributions, evaluated on a new bilingual enterprise wiki benchmark.
Contribution
The work presents RAGONITE, a novel RAG system with contextual evidence and counterfactual attribution, improving answer quality and explanation causality in enterprise data QA.
Findings
Contextualization improves retrieval and answer quality.
Counterfactual attribution provides better causal explanations.
Experiments validate the effectiveness on the ConfQuestions benchmark.
Abstract
Retrieval Augmented Generation (RAG) works as a backbone for interacting with an enterprise's own data via Conversational Question Answering (ConvQA). In a RAG system, a retriever fetches passages from a collection in response to a question, which are then included in the prompt of a large language model (LLM) for generating a natural language (NL) answer. However, several RAG systems today suffer from two shortcomings: (i) retrieved passages usually contain their raw text and lack appropriate document context, negatively impacting both retrieval and answering quality; and (ii) attribution strategies that explain answer generation typically rely only on similarity between the answer and the retrieved passages, thereby only generating plausible but not causal explanations. In this work, we demonstrate RAGONITE, a RAG system that remedies the above concerns by: (i) contextualizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Seismology and Earthquake Studies · Anomaly Detection Techniques and Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Is All You Need · Dense Connections · Byte Pair Encoding · Multi-Head Attention · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay
