Factuality Enhanced Language Models for Open-Ended Text Generation

Nayeon Lee; Wei Ping; Peng Xu; Mostofa Patwary; Pascale Fung; Mohammad; Shoeybi; Bryan Catanzaro

arXiv:2206.04624·cs.CL·March 3, 2023·54 cites

Factuality Enhanced Language Models for Open-Ended Text Generation

Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad, Shoeybi, Bryan Catanzaro

PDF

Open Access 5 Repos 1 Video

TL;DR

This paper introduces methods to measure and enhance the factual accuracy of large language models in open-ended text generation, including new benchmarks, sampling algorithms, and training techniques that significantly reduce factual errors.

Contribution

It presents the FactualityPrompts benchmark, a factual-nucleus sampling algorithm, and a factuality-enhanced training method to improve the factual correctness of language models.

Findings

01

Larger LMs tend to be more factual than smaller ones.

02

Factual-nucleus sampling improves factuality without sacrificing quality.

03

Factuality-enhanced training reduces factual errors significantly.

Abstract

Pretrained language models (LMs) are susceptible to generate text with nonfactual information. In this work, we measure and improve the factual accuracy of large-scale LMs for open-ended text generation. We design the FactualityPrompts test set and metrics to measure the factuality of LM generations. Based on that, we study the factual accuracy of LMs with parameter sizes ranging from 126M to 530B. Interestingly, we find that larger LMs are more factual than smaller ones, although a previous study suggests that larger LMs can be less truthful in terms of misconceptions. In addition, popular sampling algorithms (e.g., top-p) in open-ended text generation can harm the factuality due to the ''uniform randomness'' introduced at every sampling step. We propose the factual-nucleus sampling algorithm that dynamically adapts the randomness to improve the factuality of generation while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Factuality Enhanced Language Models for Open-Ended Text Generation· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research

MethodsTest