NN-grams: Unifying neural network and n-gram language models for Speech   Recognition

Babak Damavandi; Shankar Kumar; Noam Shazeer; Antoine Bruguier

arXiv:1606.07470·cs.CL·June 27, 2016

NN-grams: Unifying neural network and n-gram language models for Speech Recognition

Babak Damavandi, Shankar Kumar, Noam Shazeer, Antoine Bruguier

PDF

TL;DR

NN-grams is a hybrid language model that combines n-grams and neural networks, improving speech recognition performance by leveraging both memorization and generalization capabilities.

Contribution

The paper introduces NN-grams, a novel hybrid model that unifies n-gram counts with neural networks for efficient and effective speech recognition.

Findings

01

NN-grams outperforms traditional n-gram models on an Italian speech recognition task.

02

The model is trained efficiently using noise contrastive estimation without an output soft-max layer.

03

NN-grams effectively combine the strengths of n-grams and neural networks for language modeling.

Abstract

We present NN-grams, a novel, hybrid language model integrating n-grams and neural networks (NN) for speech recognition. The model takes as input both word histories as well as n-gram counts. Thus, it combines the memorization capacity and scalability of an n-gram model with the generalization ability of neural networks. We report experiments where the model is trained on 26B words. NN-grams are efficient at run-time since they do not include an output soft-max layer. The model is trained using noise contrastive estimation (NCE), an approach that transforms the estimation problem of neural networks into one of binary classification between data samples and noise samples. We present results with noise samples derived from either an n-gram distribution or from speech recognition lattices. NN-grams outperforms an n-gram model on an Italian speech recognition dictation task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.