Why Exposure Bias Matters: An Imitation Learning Perspective of Error   Accumulation in Language Generation

Kushal Arora; Layla El Asri; Hareesh Bahuleyan; Jackie Chi Kit Cheung

arXiv:2204.01171·cs.CL·January 11, 2023·1 cites

Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation

Kushal Arora, Layla El Asri, Hareesh Bahuleyan, Jackie Chi Kit Cheung

PDF

Open Access 1 Repo

TL;DR

This paper investigates how exposure bias in language models causes error accumulation, leading to issues like repetition and hallucinations, by analyzing it through an imitation learning lens and providing empirical evidence.

Contribution

It offers a novel analysis of exposure bias from an imitation learning perspective and demonstrates its impact on error accumulation in language generation.

Findings

01

Exposure bias causes error accumulation in language models.

02

Perplexity does not effectively measure error accumulation.

03

Error accumulation leads to poor generation quality.

Abstract

Current language generation models suffer from issues such as repetition, incoherence, and hallucinations. An often-repeated hypothesis is that this brittleness of generation models is caused by the training and the generation procedure mismatch, also referred to as exposure bias. In this paper, we verify this hypothesis by analyzing exposure bias from an imitation learning perspective. We show that exposure bias leads to an accumulation of errors, analyze why perplexity fails to capture this accumulation, and empirically show that this accumulation results in poor generation quality. Source code to reproduce these experiments is available at https://github.com/kushalarora/quantifying_exposure_bias

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kushalarora/quantifying_exposure_bias
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification