Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation
Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang

TL;DR
This paper introduces SynCheck, a lightweight monitoring method for retrieval-augmented language models that detects unfaithful outputs in real-time, and proposes FOD, a decoding algorithm that significantly improves faithfulness in long-form generation.
Contribution
The paper presents SynCheck, a novel synchronous faithfulness monitor, and FOD, a faithfulness-oriented decoding algorithm, enhancing trustworthiness of retrieval-augmented language models.
Findings
SynCheck achieves 0.85 AUROC in detecting faithfulness errors.
FOD outperforms traditional decoding strategies with over 10% improvement.
The methods improve faithfulness across six long-form retrieval tasks.
Abstract
Retrieval-augmented language models (RALMs) have shown strong performance and wide applicability in knowledge-intensive tasks. However, there are significant trustworthiness concerns as RALMs are prone to generating unfaithful outputs, including baseless information or contradictions with the retrieved context. This paper proposes SynCheck, a lightweight monitor that leverages fine-grained decoding dynamics including sequence likelihood, uncertainty quantification, context influence, and semantic alignment to synchronously detect unfaithful sentences. By integrating efficiently measurable and complementary signals, SynCheck enables accurate and immediate feedback and intervention, achieving 0.85 AUROC in detecting faithfulness errors across six long-form retrieval-augmented generation tasks, improving prior best method by 4%. Leveraging SynCheck, we further introduce FOD, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCloud Computing and Resource Management · Caching and Content Delivery · Advanced Memory and Neural Computing
