Reducing LLM Hallucinations using Epistemic Neural Networks

Shreyas Verma; Kien Tran; Yusuf Ali; Guangyu Min

arXiv:2312.15576·cs.CL·December 27, 2023·1 cites

Reducing LLM Hallucinations using Epistemic Neural Networks

Shreyas Verma, Kien Tran, Yusuf Ali, Guangyu Min

PDF

Open Access

TL;DR

This paper proposes a novel approach using epistemic neural networks to improve uncertainty estimation in large language models, aiming to reduce hallucinations and enhance output reliability.

Contribution

First to train an epistemic neural network for next token prediction on a large language model to specifically address hallucination reduction.

Findings

01

Reduced hallucinations on the TruthfulQA dataset.

02

Improved uncertainty estimates for large language models.

Abstract

Reducing and detecting hallucinations in large language models is an open research problem. In this project, we attempt to leverage recent advances in the field of uncertainty estimation to reduce hallucinations in frozen large language models. Epistemic neural networks have recently been proposed to improve output joint distributions for large pre-trained models. ENNs are small networks attached to large, frozen models to improve the model's joint distributions and uncertainty estimates. In this work, we train an epistemic neural network on top of the Llama-2 7B model combined with a contrastive decoding feature enhancement technique. We are the first to train an ENN for the next token prediction task and explore the efficacy of this method in reducing hallucinations on the TruthfulQA dataset. In essence, we provide a method that leverages a pre-trained model's latent embeddings to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification