PURR: Efficiently Editing Language Model Hallucinations by Denoising   Language Model Corruptions

Anthony Chen; Panupong Pasupat; Sameer Singh; Hongrae Lee; Kelvin; Guu

arXiv:2305.14908·cs.CL·May 25, 2023·6 cites

PURR: Efficiently Editing Language Model Hallucinations by Denoising Language Model Corruptions

Anthony Chen, Panupong Pasupat, Sameer Singh, Hongrae Lee, Kelvin, Guu

PDF

Open Access

TL;DR

The paper introduces PURR, an unsupervised method that efficiently edits language model outputs to reduce hallucinations by training compact denoisers with artificially corrupted text, achieving faster and more accurate results.

Contribution

PURR is a novel unsupervised approach that trains compact editors to denoise corrupted text, significantly improving attribution and reducing hallucinations in language models.

Findings

01

PURR outperforms existing editing methods in attribution accuracy.

02

PURR achieves orders of magnitude faster inference times.

03

PURR effectively trains on domain-agnostic corrupted data.

Abstract

The remarkable capabilities of large language models have been accompanied by a persistent drawback: the generation of false and unsubstantiated claims commonly known as "hallucinations". To combat this issue, recent research has introduced approaches that involve editing and attributing the outputs of language models, particularly through prompt-based editing. However, the inference cost and speed of using large language models for editing currently bottleneck prompt-based methods. These bottlenecks motivate the training of compact editors, which is challenging due to the scarcity of training data for this purpose. To overcome these challenges, we exploit the power of large language models to introduce corruptions (i.e., noise) into text and subsequently fine-tune compact editors to denoise the corruptions by incorporating relevant evidence. Our methodology is entirely unsupervised and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings