Improving Fluency of Non-Autoregressive Machine Translation

Zden\v{e}k Kasner; Jind\v{r}ich Libovick\'y; Jind\v{r}ich Helcl

arXiv:2004.03227·cs.CL·April 8, 2020·6 cites

Improving Fluency of Non-Autoregressive Machine Translation

Zden\v{e}k Kasner, Jind\v{r}ich Libovick\'y, Jind\v{r}ich Helcl

PDF

Open Access

TL;DR

This paper enhances the fluency of non-autoregressive machine translation models using CTC and additional features, maintaining high decoding speed and achieving competitive BLEU scores across multiple language pairs.

Contribution

It introduces a method to improve nAR translation fluency with CTC and feature-enhanced scoring, preserving decoding speed advantages.

Findings

01

Improved fluency of nAR models with CTC and feature scoring.

02

Decoding speed remains higher than AR models.

03

Achieved competitive BLEU scores on multiple language pairs.

Abstract

Non-autoregressive (nAR) models for machine translation (MT) manifest superior decoding speed when compared to autoregressive (AR) models, at the expense of impaired fluency of their outputs. We improve the fluency of a nAR model with connectionist temporal classification (CTC) by employing additional features in the scoring model used during beam search decoding. Since the beam search decoding in our model only requires to run the network in a single forward pass, the decoding speed is still notably higher than in standard AR models. We train models for three language pairs: German, Czech, and Romanian from and into English. The results show that our proposed models can be more efficient in terms of decoding speed and still achieve a competitive BLEU score relative to AR models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings