Entity-Aware Language Model as an Unsupervised Reranker

Mohammad Sadegh Rasooli; Sarangarajan Parthasarathy

arXiv:1803.04291·cs.CL·June 19, 2018

Entity-Aware Language Model as an Unsupervised Reranker

Mohammad Sadegh Rasooli, Sarangarajan Parthasarathy

PDF

TL;DR

This paper introduces an unsupervised reranker that incorporates entity relationships and external knowledge-base features into language modeling, improving accuracy without needing manually annotated data.

Contribution

It proposes a contrastive estimation-based method for entity-aware reranking that eliminates the need for annotated n-best lists, enhancing language model performance.

Findings

01

Achieved 0.44% absolute WER reduction over baseline LSTM model.

02

Successfully integrated global features and knowledge-base information.

03

Demonstrated effectiveness in the music domain.

Abstract

In language modeling, it is difficult to incorporate entity relationships from a knowledge-base. One solution is to use a reranker trained with global features, in which global features are derived from n-best lists. However, training such a reranker requires manually annotated n-best lists, which is expensive to obtain. We propose a method based on the contrastive estimation method that alleviates the need for such data. Experiments in the music domain demonstrate that global features, as well as features extracted from an external knowledge-base, can be incorporated into our reranker. Our final model, a simple ensemble of a language model and reranker, achieves a 0.44\% absolute word error rate improvement over an LSTM language model on the blind test data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory