A Non-monotonic Self-terminating Language Model

Eugene Choi; Kyunghyun Cho; Cheolhyoung Lee

arXiv:2210.00660·cs.LG·February 8, 2023

A Non-monotonic Self-terminating Language Model

Eugene Choi, Kyunghyun Cho, Cheolhyoung Lee

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a non-monotonic self-terminating language model that effectively prevents non-terminating sequences during decoding, improving the reliability of sequence generation in neural language models.

Contribution

It relaxes the monotonicity constraint of previous models, allowing for more flexible termination probabilities and ensuring termination with various decoding algorithms.

Findings

01

Prevents non-terminating sequences with incomplete decoding algorithms.

02

Proves effectiveness with beam search and other decoding methods.

03

Validated on sequence completion tasks across architectures.

Abstract

Recent large-scale neural autoregressive sequence models have shown impressive performances on a variety of natural language generation tasks. However, their generated sequences often exhibit degenerate properties such as non-termination, undesirable repetition, and premature termination, when generated with decoding algorithms such as greedy search, beam search, top- $k$ sampling, and nucleus sampling. In this paper, we focus on the problem of non-terminating sequences resulting from an incomplete decoding algorithm. We first define an incomplete probable decoding algorithm which includes greedy search, top- $k$ sampling, and nucleus sampling, beyond the incomplete decoding algorithm originally put forward by Welleck et al. (2020). We then propose a non-monotonic self-terminating language model, which significantly relaxes the constraint of monotonically increasing termination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nyu-dl/non-monotonic-self-terminating-lm
pytorchOfficial

Videos

A Non-monotonic Self-terminating Language Model· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification