Iter-AHMCL: Alleviate Hallucination for Large Language Model via   Iterative Model-level Contrastive Learning

Huiwen Wu; Xiaohan Li; Xiaogang Xu; Jiafei Wu; Deyi Zhang; Zhe Liu

arXiv:2410.12130·cs.CL·October 17, 2024

Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning

Huiwen Wu, Xiaohan Li, Xiaogang Xu, Jiafei Wu, Deyi Zhang, Zhe Liu

PDF

Open Access

TL;DR

This paper proposes Iter-AHMCL, an iterative contrastive learning method that reduces hallucinations in large language models by contrasting models trained on hallucinated and non-hallucinated data, improving factual accuracy.

Contribution

It introduces a novel iterative contrastive learning approach at the model level to effectively mitigate hallucinations in pre-trained LLMs without sacrificing their capabilities.

Findings

01

Achieves an average of 10.1 points improvement on TruthfulQA benchmark.

02

Effectively reduces hallucinations while maintaining LLM performance.

03

Validated on four different pre-trained foundation LLMs.

Abstract

The development of Large Language Models (LLMs) has significantly advanced various AI applications in commercial and scientific research fields, such as scientific literature summarization, writing assistance, and knowledge graph construction. However, a significant challenge is the high risk of hallucination during LLM inference, which can lead to security concerns like factual inaccuracies, inconsistent information, and fabricated content. To tackle this issue, it is essential to develop effective methods for reducing hallucination while maintaining the original capabilities of the LLM. This paper introduces a novel approach called Iterative Model-level Contrastive Learning (Iter-AHMCL) to address hallucination. This method modifies the representation layers of pre-trained LLMs by using contrastive `positive' and `negative' models, trained on data with and without hallucinations. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Tuberculosis Research and Epidemiology

MethodsContrastive Learning