Learning to Diversify Neural Text Generation via Degenerative Model

Jimin Hong; ChaeHun Park; Jaegul Choo

arXiv:2309.12619·cs.CL·September 25, 2023

Learning to Diversify Neural Text Generation via Degenerative Model

Jimin Hong, ChaeHun Park, Jaegul Choo

PDF

Open Access

TL;DR

This paper introduces a novel method for improving diversity in neural text generation by training two models, one to identify undesirable patterns and another to focus on diverse patterns, leading to more informative outputs.

Contribution

It presents a new degenerative model approach that enhances diversity by leveraging models that learn undesirable and diverse patterns separately.

Findings

01

Improved diversity in language modeling and dialogue generation

02

Effective reduction of repetitive and overused words

03

Enhanced informativeness of generated texts

Abstract

Neural language models often fail to generate diverse and informative texts, limiting their applicability in real-world problems. While previous approaches have proposed to address these issues by identifying and penalizing undesirable behaviors (e.g., repetition, overuse of frequent words) from language models, we propose an alternative approach based on an observation: models primarily learn attributes within examples that are likely to cause degeneration problems. Based on this observation, we propose a new approach to prevent degeneration problems by training two models. Specifically, we first train a model that is designed to amplify undesirable patterns. We then enhance the diversity of the second model by focusing on patterns that the first model fails to learn. Extensive experiments on two tasks, namely language modeling and dialogue generation, demonstrate the effectiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

Methodsfail