Imputer: Sequence Modelling via Imputation and Dynamic Programming

William Chan; Chitwan Saharia; Geoffrey Hinton; Mohammad Norouzi,; Navdeep Jaitly

arXiv:2002.08926·eess.AS·April 23, 2020·23 cites

Imputer: Sequence Modelling via Imputation and Dynamic Programming

William Chan, Chitwan Saharia, Geoffrey Hinton, Mohammad Norouzi,, Navdeep Jaitly

PDF

Open Access 1 Repo 2 Videos

TL;DR

The paper introduces the Imputer, a neural sequence model that generates sequences through iterative imputations, enabling efficient training and competitive performance in speech recognition tasks.

Contribution

It proposes a novel iterative generative model with a dynamic programming training algorithm that marginalizes over alignments and generation orders.

Findings

01

Outperforms prior non-autoregressive models in speech recognition

02

Achieves 11.1 WER on LibriSpeech test-other, better than CTC and seq2seq

03

Requires only a constant number of generation steps regardless of sequence length

Abstract

This paper presents the Imputer, a neural sequence model that generates output sequences iteratively via imputations. The Imputer is an iterative generative model, requiring only a constant number of generation steps independent of the number of input or output tokens. The Imputer can be trained to approximately marginalize over all possible alignments between the input and output sequences, and all possible generation orders. We present a tractable dynamic programming training algorithm, which yields a lower bound on the log marginal likelihood. When applied to end-to-end speech recognition, the Imputer outperforms prior non-autoregressive models and achieves competitive results to autoregressive models. On LibriSpeech test-other, the Imputer achieves 11.1 WER, outperforming CTC at 13.0 WER and seq2seq at 12.5 WER.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rosinality/imputer-pytorch
pytorch

Videos

Imputer: Sequence Modelling via Imputation and Dynamic Programming· youtube

Imputer: Sequence Modelling via Imputation and Dynamic Programming· slideslive

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence