FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations
Mayank Ravishankara

TL;DR
FVA-RAG introduces a novel approach that inverts the retrieval process in RAG systems by explicitly seeking counter-evidence to test and reduce hallucinations, significantly improving factual accuracy.
Contribution
The paper presents FVA-RAG, a new pipeline that enhances retrieval-augmented generation by explicitly retrieving anti-evidence to mitigate sycophantic hallucinations.
Findings
FVA-RAG achieves over 80% accuracy on TruthfulQA-Generation benchmark.
It triggers falsification in about 25% of queries, effectively reducing hallucinations.
Outperforms existing methods like Self-RAG and CRAG with high statistical significance.
Abstract
Retrieval-Augmented Generation (RAG) reduces hallucinations by grounding answers in retrieved evidence, yet standard retrievers often exhibit retrieval sycophancy: they preferentially surface evidence that supports a user's premise, even when the premise is false. We propose FVA-RAG (Falsification-Verification Alignment RAG), a pipeline that inverts the standard RAG workflow by treating the initial response as a draft hypothesis and explicitly retrieving anti-context to stress-test it. We evaluate on the full TruthfulQA-Generation benchmark (N=817) under a fully frozen protocol with 0 live web calls and identical retrieval budgets across methods. Using gpt-4o for generation and deterministic judging, FVA-RAG achieves 79.80-80.05% accuracy across two independently built frozen corpora , significantly outperforming prompted variants of Self-RAG (71.11-72.22%) and CRAG (71.36-73.93%) with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Misinformation and Its Impacts
