Iterative Refinement in the Continuous Space for Non-Autoregressive Neural Machine Translation
Jason Lee, Raphael Shu, Kyunghyun Cho

TL;DR
This paper introduces an efficient continuous-space iterative refinement method for non-autoregressive neural machine translation, improving speed and translation quality over previous approaches by optimizing in a low-dimensional latent space.
Contribution
The authors propose a novel gradient-based inference procedure in continuous latent space for non-autoregressive translation, outperforming existing hybrid-space methods in efficiency and effectiveness.
Findings
Faster decoding compared to EM-like inference methods.
Higher BLEU scores with the same number of refinement steps.
Decodes 6.2 times faster than autoregressive models with minimal quality loss.
Abstract
We propose an efficient inference procedure for non-autoregressive machine translation that iteratively refines translation purely in the continuous space. Given a continuous latent variable model for machine translation (Shu et al., 2020), we train an inference network to approximate the gradient of the marginal log probability of the target sentence, using only the latent variable as input. This allows us to use gradient-based optimization to find the target sentence at inference time that approximately maximizes its marginal probability. As each refinement step only involves computation in the latent space of low dimensionality (we use 8 in our experiments), we avoid computational overhead incurred by existing non-autoregressive inference procedures that often refine in token space. We compare our approach to a recently proposed EM-like inference procedure (Shu et al., 2020) that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
