Learning to Diversify Neural Text Generation via Degenerative Model
Jimin Hong, ChaeHun Park, Jaegul Choo

TL;DR
This paper introduces a novel method for improving diversity in neural text generation by training two models, one to identify undesirable patterns and another to focus on diverse patterns, leading to more informative outputs.
Contribution
It presents a new degenerative model approach that enhances diversity by leveraging models that learn undesirable and diverse patterns separately.
Findings
Improved diversity in language modeling and dialogue generation
Effective reduction of repetitive and overused words
Enhanced informativeness of generated texts
Abstract
Neural language models often fail to generate diverse and informative texts, limiting their applicability in real-world problems. While previous approaches have proposed to address these issues by identifying and penalizing undesirable behaviors (e.g., repetition, overuse of frequent words) from language models, we propose an alternative approach based on an observation: models primarily learn attributes within examples that are likely to cause degeneration problems. Based on this observation, we propose a new approach to prevent degeneration problems by training two models. Specifically, we first train a model that is designed to amplify undesirable patterns. We then enhance the diversity of the second model by focusing on patterns that the first model fails to learn. Extensive experiments on two tasks, namely language modeling and dialogue generation, demonstrate the effectiveness of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
Methodsfail
