Spiral Language Modeling

Yong Cao; Yukun Feng; Shaohui Kuang; Gu Xu

arXiv:2112.10543·cs.CL·December 21, 2021

Spiral Language Modeling

Yong Cao, Yukun Feng, Shaohui Kuang, Gu Xu

PDF

Open Access

TL;DR

Spiral Language Modeling (SLM) introduces a flexible text generation approach that constructs sentences starting from arbitrary tokens, enhancing diversity, quality, and robustness in neural machine translation tasks.

Contribution

SLM enables non-linear text generation starting from any token, improving translation quality and robustness compared to traditional left-to-right methods.

Findings

01

Up to 4.7 BLEU score improvements in NMT tasks

02

Enhanced diversity and quality of generated text

03

Improved robustness in low-resource scenarios

Abstract

In almost all text generation applications, word sequences are constructed in a left-to-right (L2R) or right-to-left (R2L) manner, as natural language sentences are written either L2R or R2L. However, we find that the natural language written order is not essential for text generation. In this paper, we propose Spiral Language Modeling (SLM), a general approach that enables one to construct natural language sentences beyond the L2R and R2L order. SLM allows one to form natural language text by starting from an arbitrary token inside the result text and expanding the rest tokens around the selected ones. It makes the decoding order a new optimization objective besides the language model perplexity, which further improves the diversity and quality of the generated text. Furthermore, SLM makes it possible to manipulate the text construction process by selecting a proper starting token. SLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification