Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented   Generation

Di Wu; Jia-Chen Gu; Fan Yin; Nanyun Peng; Kai-Wei Chang

arXiv:2406.13692·cs.CL·October 7, 2024·1 cites

Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation

Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces SynCheck, a lightweight monitoring method for retrieval-augmented language models that detects unfaithful outputs in real-time, and proposes FOD, a decoding algorithm that significantly improves faithfulness in long-form generation.

Contribution

The paper presents SynCheck, a novel synchronous faithfulness monitor, and FOD, a faithfulness-oriented decoding algorithm, enhancing trustworthiness of retrieval-augmented language models.

Findings

01

SynCheck achieves 0.85 AUROC in detecting faithfulness errors.

02

FOD outperforms traditional decoding strategies with over 10% improvement.

03

The methods improve faithfulness across six long-form retrieval tasks.

Abstract

Retrieval-augmented language models (RALMs) have shown strong performance and wide applicability in knowledge-intensive tasks. However, there are significant trustworthiness concerns as RALMs are prone to generating unfaithful outputs, including baseless information or contradictions with the retrieved context. This paper proposes SynCheck, a lightweight monitor that leverages fine-grained decoding dynamics including sequence likelihood, uncertainty quantification, context influence, and semantic alignment to synchronously detect unfaithful sentences. By integrating efficiently measurable and complementary signals, SynCheck enables accurate and immediate feedback and intervention, achieving 0.85 AUROC in detecting faithfulness errors across six long-form retrieval-augmented generation tasks, improving prior best method by 4%. Leveraging SynCheck, we further introduce FOD, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaowu0162/sync-ralm-faithfulness
pytorchOfficial

Videos

Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation· underline

Taxonomy

TopicsCloud Computing and Resource Management · Caching and Content Delivery · Advanced Memory and Neural Computing