Iterative Refinement in the Continuous Space for Non-Autoregressive   Neural Machine Translation

Jason Lee; Raphael Shu; Kyunghyun Cho

arXiv:2009.07177·cs.CL·September 16, 2020·1 cites

Iterative Refinement in the Continuous Space for Non-Autoregressive Neural Machine Translation

Jason Lee, Raphael Shu, Kyunghyun Cho

PDF

Open Access 1 Repo

TL;DR

This paper introduces an efficient continuous-space iterative refinement method for non-autoregressive neural machine translation, improving speed and translation quality over previous approaches by optimizing in a low-dimensional latent space.

Contribution

The authors propose a novel gradient-based inference procedure in continuous latent space for non-autoregressive translation, outperforming existing hybrid-space methods in efficiency and effectiveness.

Findings

01

Faster decoding compared to EM-like inference methods.

02

Higher BLEU scores with the same number of refinement steps.

03

Decodes 6.2 times faster than autoregressive models with minimal quality loss.

Abstract

We propose an efficient inference procedure for non-autoregressive machine translation that iteratively refines translation purely in the continuous space. Given a continuous latent variable model for machine translation (Shu et al., 2020), we train an inference network to approximate the gradient of the marginal log probability of the target sentence, using only the latent variable as input. This allows us to use gradient-based optimization to find the target sentence at inference time that approximately maximizes its marginal probability. As each refinement step only involves computation in the latent space of low dimensionality (we use 8 in our experiments), we avoid computational overhead incurred by existing non-autoregressive inference procedures that often refine in token space. We compare our approach to a recently proposed EM-like inference procedure (Shu et al., 2020) that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zomux/lanmt-ebm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification