Delta -- Contrastive Decoding Mitigates Text Hallucinations in Large   Language Models

Cheng Peng Huang; Hao-Yuan Chen

arXiv:2502.05825·cs.CL·February 19, 2025

Delta -- Contrastive Decoding Mitigates Text Hallucinations in Large Language Models

Cheng Peng Huang, Hao-Yuan Chen

PDF

Open Access

TL;DR

Delta is an inference-time technique that reduces hallucinations in large language models by contrasting output distributions with masked inputs, improving factual accuracy without retraining.

Contribution

The paper introduces Delta, a novel inference-only method that mitigates hallucinations in LLMs by contrasting outputs for original and masked prompts, requiring no retraining or extra data.

Findings

01

Improves SQuAD v1.1 and v2 accuracy by 3-6 percentage points.

02

Enhances TriviaQA and Natural Questions scores by 2-7 points.

03

Increases SQuAD v2 no-answer exact match by over 10 percentage points.

Abstract

Large language models (LLMs) demonstrate strong capabilities in natural language processing but remain prone to hallucinations, generating factually incorrect or fabricated content. This issue undermines their reliability, particularly in high-stakes domains such as healthcare and legal advisory. To address this challenge, we propose Delta, an inference-time method that reduces hallucinations without requiring model retraining or additional data. Delta works by randomly masking parts of the input prompt and contrasting the output distributions for the original and masked inputs, effectively suppressing hallucinations through inference-only computations. We evaluate Delta on context-rich question-answering benchmarks, achieving absolute improvements of approximately 3 and 6 percentage points on SQuAD v1.1 and v2, respectively, and 7 and 2 percentage points on TriviaQA and Natural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Machine Learning in Healthcare