$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text   Generation

Yu Bao; Hao Zhou; Shujian Huang; Dongqi Wang; Lihua Qian; Xinyu Dai,; Jiajun Chen; Lei Li

arXiv:2204.02030·cs.CL·April 6, 2022·1 cites

$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text Generation

Yu Bao, Hao Zhou, Shujian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai,, Jiajun Chen, Lei Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces $ extit{latent}$-GLAT, a parallel text generation method using discrete latent variables and curriculum learning, which improves generation quality without relying on autoregressive models, expanding application possibilities.

Contribution

It proposes a novel parallel text generation approach with discrete latent variables and curriculum learning, eliminating the need for autoregressive training.

Findings

01

Outperforms strong baselines in quality

02

Does not require autoregressive models for training

03

Broadens application scenarios of parallel decoding

Abstract

Recently, parallel text generation has received widespread attention due to its success in generation efficiency. Although many advanced techniques are proposed to improve its generation quality, they still need the help of an autoregressive model for training to overcome the one-to-many multi-modal phenomenon in the dataset, limiting their applications. In this paper, we propose $latent$ -GLAT, which employs the discrete latent variables to capture word categorical information and invoke an advanced curriculum learning technique, alleviating the multi-modality problem. Experiment results show that our method outperforms strong baselines without the help of an autoregressive model, which further broadens the application scenarios of the parallel decoding paradigm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

baoy-nlp/latent-glat
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications