Knowledge Verification to Nip Hallucination in the Bud
Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming, Shi

TL;DR
This paper introduces Knowledge Consistent Alignment (KCA), a method that reduces hallucinations in large language models by verifying and minimizing inconsistencies between external knowledge and the models' internal knowledge, showing improved results across multiple benchmarks.
Contribution
The paper proposes KCA, a novel approach that employs a well-aligned LLM to assess and reduce knowledge inconsistencies, effectively mitigating hallucinations in foundation LLMs.
Findings
KCA significantly reduces hallucinations across six benchmarks.
KCA is effective for various backbone LLMs and scales.
Open-source code and data are provided for reproducibility.
Abstract
While large language models (LLMs) have demonstrated exceptional performance across various tasks following human alignment, they may still generate responses that sound plausible but contradict factual knowledge, a phenomenon known as hallucination. In this paper, we demonstrate the feasibility of mitigating hallucinations by verifying and minimizing the inconsistency between external knowledge present in the alignment data and the intrinsic knowledge embedded within foundation LLMs. Specifically, we propose a novel approach called Knowledge Consistent Alignment (KCA), which employs a well-aligned LLM to automatically formulate assessments based on external knowledge to evaluate the knowledge boundaries of foundation LLMs. To address knowledge inconsistencies in the alignment data, KCA implements several specific strategies to deal with these data instances. We demonstrate the superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Healthcare
