An EM Approach to Non-autoregressive Conditional Sequence Generation

Zhiqing Sun; Yiming Yang

arXiv:2006.16378·cs.LG·July 1, 2020·29 cites

An EM Approach to Non-autoregressive Conditional Sequence Generation

Zhiqing Sun, Yiming Yang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel EM-based training framework for non-autoregressive sequence generation that improves accuracy and reduces inference latency by jointly optimizing AR and NAR models.

Contribution

It is the first to apply an EM approach to NAR sequence generation, effectively addressing multi-modality issues and enhancing performance.

Findings

01

Achieves competitive or better translation quality than existing NAR models.

02

Significantly reduces inference latency in machine translation tasks.

03

Demonstrates effectiveness on benchmark datasets.

Abstract

Autoregressive (AR) models have been the dominating approach to conditional sequence generation, but are suffering from the issue of high inference latency. Non-autoregressive (NAR) models have been recently proposed to reduce the latency by generating all output tokens in parallel but could only achieve inferior accuracy compared to their autoregressive counterparts, primarily due to a difficulty in dealing with the multi-modality in sequence generation. This paper proposes a new approach that jointly optimizes both AR and NAR models in a unified Expectation-Maximization (EM) framework. In the E-step, an AR model learns to approximate the regularized posterior of the NAR model. In the M-step, the NAR model is updated on the new posterior and selects the training examples for the next AR model. This iterative process can effectively guide the system to remove the multi-modality in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

edward-sun/nat-em
tf

Videos

An EM Approach to Non-autoregressive Conditional Sequence Generation· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Machine Learning and Data Classification · Speech Recognition and Synthesis