Knowledge Verification to Nip Hallucination in the Bud

Fanqi Wan; Xinting Huang; Leyang Cui; Xiaojun Quan; Wei Bi; Shuming; Shi

arXiv:2401.10768·cs.CL·September 24, 2024·2 cites

Knowledge Verification to Nip Hallucination in the Bud

Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming, Shi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Knowledge Consistent Alignment (KCA), a method that reduces hallucinations in large language models by verifying and minimizing inconsistencies between external knowledge and the models' internal knowledge, showing improved results across multiple benchmarks.

Contribution

The paper proposes KCA, a novel approach that employs a well-aligned LLM to assess and reduce knowledge inconsistencies, effectively mitigating hallucinations in foundation LLMs.

Findings

01

KCA significantly reduces hallucinations across six benchmarks.

02

KCA is effective for various backbone LLMs and scales.

03

Open-source code and data are provided for reproducibility.

Abstract

While large language models (LLMs) have demonstrated exceptional performance across various tasks following human alignment, they may still generate responses that sound plausible but contradict factual knowledge, a phenomenon known as hallucination. In this paper, we demonstrate the feasibility of mitigating hallucinations by verifying and minimizing the inconsistency between external knowledge present in the alignment data and the intrinsic knowledge embedded within foundation LLMs. Specifically, we propose a novel approach called Knowledge Consistent Alignment (KCA), which employs a well-aligned LLM to automatically formulate assessments based on external knowledge to evaluate the knowledge boundaries of foundation LLMs. To address knowledge inconsistencies in the alignment data, KCA implements several specific strategies to deal with these data instances. We demonstrate the superior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fanqiwan/kca
pytorchOfficial

Videos

Knowledge Verification to Nip Hallucination in the Bud· underline

Taxonomy

TopicsMachine Learning in Healthcare