A Non-monotonic Self-terminating Language Model
Eugene Choi, Kyunghyun Cho, Cheolhyoung Lee

TL;DR
This paper introduces a non-monotonic self-terminating language model that effectively prevents non-terminating sequences during decoding, improving the reliability of sequence generation in neural language models.
Contribution
It relaxes the monotonicity constraint of previous models, allowing for more flexible termination probabilities and ensuring termination with various decoding algorithms.
Findings
Prevents non-terminating sequences with incomplete decoding algorithms.
Proves effectiveness with beam search and other decoding methods.
Validated on sequence completion tasks across architectures.
Abstract
Recent large-scale neural autoregressive sequence models have shown impressive performances on a variety of natural language generation tasks. However, their generated sequences often exhibit degenerate properties such as non-termination, undesirable repetition, and premature termination, when generated with decoding algorithms such as greedy search, beam search, top- sampling, and nucleus sampling. In this paper, we focus on the problem of non-terminating sequences resulting from an incomplete decoding algorithm. We first define an incomplete probable decoding algorithm which includes greedy search, top- sampling, and nucleus sampling, beyond the incomplete decoding algorithm originally put forward by Welleck et al. (2020). We then propose a non-monotonic self-terminating language model, which significantly relaxes the constraint of monotonically increasing termination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
