On Training Bi-directional Neural Network Language Model with Noise   Contrastive Estimation

Tianxing He; Yu Zhang; Jasha Droppo; Kai Yu

arXiv:1602.06064·cs.CL·February 26, 2016·1 cites

On Training Bi-directional Neural Network Language Model with Noise Contrastive Estimation

Tianxing He, Yu Zhang, Jasha Droppo, Kai Yu

PDF

Open Access 1 Repo

TL;DR

This paper explores training bi-directional neural network language models using noise contrastive estimation, showing improvements over traditional methods but not surpassing uni-directional models in a specific task.

Contribution

It introduces a novel training approach for bi-directional neural network language models using noise contrastive estimation.

Findings

01

NCE-trained bi-directional NNLM outperforms maximum likelihood trained models.

02

Bi-directional NNLM does not outperform baseline uni-directional NNLM.

03

The approach shows promise but has limitations in surpassing existing models.

Abstract

We propose to train bi-directional neural network language model(NNLM) with noise contrastive estimation(NCE). Experiments are conducted on a rescore task on the PTB data set. It is shown that NCE-trained bi-directional NNLM outperformed the one trained by conventional maximum likelihood training. But still(regretfully), it did not out-perform the baseline uni-directional NNLM.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://bitbucket.org/cloudygoose/ptb_rescore
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques