End-to-End Non-Autoregressive Neural Machine Translation with   Connectionist Temporal Classification

Jind\v{r}ich Libovick\'y; Jind\v{r}ich Helcl

arXiv:1811.04719·cs.CL·November 13, 2018·6 cites

End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification

Jind\v{r}ich Libovick\'y, Jind\v{r}ich Helcl

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel end-to-end non-autoregressive neural machine translation model based on connectionist temporal classification, enabling faster inference while maintaining translation quality.

Contribution

It presents a new non-autoregressive architecture that can be trained end-to-end, unlike previous multi-step methods, improving translation speed without sacrificing quality.

Findings

01

Significant speedup over autoregressive models.

02

Translation quality comparable to other non-autoregressive models.

03

Effective on WMT English-Romanian and English-German datasets.

Abstract

Autoregressive decoding is the only part of sequence-to-sequence models that prevents them from massive parallelization at inference time. Non-autoregressive models enable the decoder to generate all output symbols independently in parallel. We present a novel non-autoregressive architecture based on connectionist temporal classification and evaluate it on the task of neural machine translation. Unlike other non-autoregressive methods which operate in several steps, our model can be trained end-to-end. We conduct experiments on the WMT English-Romanian and English-German datasets. Our models achieve a significant speedup over the autoregressive models, keeping the translation quality comparable to other non-autoregressive models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

m3yrin/nar-latent-alignment
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis