Generation Constraint Scaling Can Mitigate Hallucination

Georgios Kollias; Payel Das; Subhajit Chaudhury

arXiv:2407.16908·cs.CL·July 25, 2024

Generation Constraint Scaling Can Mitigate Hallucination

Georgios Kollias, Payel Das, Subhajit Chaudhury

PDF

TL;DR

This paper proposes a geometry-inspired, training-free method to reduce hallucinations in large language models by scaling the readout vector, improving generation quality and efficiency.

Contribution

It introduces a novel scaling technique for memory-augmented LLMs that mitigates hallucinations without additional training or fine-tuning.

Findings

01

Outperforms state-of-the-art LLM editing methods

02

Reduces hallucinations in Wikipedia biography generation

03

Improves runtime efficiency

Abstract

Addressing the issue of hallucinations in large language models (LLMs) is a critical challenge. As the cognitive mechanisms of hallucination have been related to memory, here we explore hallucination for LLM that is enabled with explicit memory mechanisms. We empirically demonstrate that by simply scaling the readout vector that constrains generation in a memory-augmented LLM decoder, hallucination mitigation can be achieved in a training-free manner. Our method is geometry-inspired and outperforms a state-of-the-art LLM editing method on the task of generation of Wikipedia-like biography entries both in terms of generation quality and runtime complexity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.