Non-Autoregressive Neural Machine Translation: A Call for Clarity

Robin M. Schmidt; Telmo Pires; Stephan Peitz; Jonas L\"o\"of

arXiv:2205.10577·cs.CL·October 24, 2022·1 cites

Non-Autoregressive Neural Machine Translation: A Call for Clarity

Robin M. Schmidt, Telmo Pires, Stephan Peitz, Jonas L\"o\"of

PDF

Open Access

TL;DR

This paper critically evaluates non-autoregressive neural machine translation, emphasizing the importance of standardized benchmarks and providing insights to establish strong baselines for future research.

Contribution

It offers a comprehensive comparison of techniques for non-autoregressive translation, introduces standardized evaluation protocols, and provides open-source code for reproducibility.

Findings

01

Standardized BLEU, chrF++, and TER scores for four translation tasks.

02

Insights into effective baseline configurations using length prediction and CTC architectures.

03

Identification of inconsistencies in previous BLEU score reporting.

Abstract

Non-autoregressive approaches aim to improve the inference speed of translation models by only requiring a single forward pass to generate the output sequence instead of iteratively producing each predicted token. Consequently, their translation quality still tends to be inferior to their autoregressive counterparts due to several issues involving output token interdependence. In this work, we take a step back and revisit several techniques that have been proposed for improving non-autoregressive translation models and compare their combined translation quality and speed implications under third-party testing environments. We provide novel insights for establishing strong baselines using length prediction or CTC-based architecture variants and contribute standardized BLEU, chrF++, and TER scores using sacreBLEU on four translation tasks, which crucially have been missing as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Adversarial Robustness in Machine Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings