Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval-Augmented Generation
Tobias Leemann, Periklis Petridis, Giuseppe Vietri, Dionysis, Manousakas, Aaron Roth, Sergul Aydore

TL;DR
Auto-GDA introduces an unsupervised, iterative domain adaptation framework that enhances lightweight NLI models for more efficient grounding verification in RAG systems, reducing computational costs while maintaining high accuracy.
Contribution
It proposes a novel automatic generative domain adaptation method that improves NLI models for RAG without requiring labeled data or extensive fine-tuning.
Findings
Synthetic data generated by Auto-GDA improves NLI model performance.
Models fine-tuned on Auto-GDA data outperform teacher models.
Auto-GDA achieves comparable results to large LLMs at 10% of their computational cost.
Abstract
While retrieval-augmented generation (RAG) has been shown to enhance factuality of large language model (LLM) outputs, LLMs still suffer from hallucination, generating incorrect or irrelevant information. A common detection strategy involves prompting the LLM again to assess whether its response is grounded in the retrieved evidence, but this approach is costly. Alternatively, lightweight natural language inference (NLI) models for efficient grounding verification can be used at inference time. While existing pre-trained NLI models offer potential solutions, their performance remains subpar compared to larger models on realistic RAG inputs. RAG inputs are more complex than most datasets used for training NLI models and have characteristics specific to the underlying knowledge base, requiring adaptation of the NLI models to a specific target domain. Additionally, the lack of labeled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Speech and dialogue systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Attention Dropout · Linear Layer · Weight Decay · Linear Warmup With Linear Decay · Dropout · Byte Pair Encoding · BERT
