Methods to integrate a language model with semantic information for a   word prediction component

Tonio Wandmacher; Jean-Yves Antoine

arXiv:0801.4716·cs.CL·January 31, 2008·38 cites

Methods to integrate a language model with semantic information for a word prediction component

Tonio Wandmacher, Jean-Yves Antoine

PDF

Open Access

TL;DR

This paper explores integrating Latent Semantic Analysis (LSA) with n-gram language models to improve word prediction accuracy by leveraging semantic dependencies.

Contribution

It introduces and evaluates methods combining LSA with standard language models, demonstrating significant performance improvements.

Findings

01

All proposed methods outperform the 4-gram baseline.

02

Most methods outperform a simple cache model.

03

Semantic integration enhances long-distance semantic dependency modeling.

Abstract

Most current word prediction systems make use of n-gram language models (LM) to estimate the probability of the following word in a phrase. In the past years there have been many attempts to enrich such language models with further syntactic or semantic information. We want to explore the predictive powers of Latent Semantic Analysis (LSA), a method that has been shown to provide reliable information on long-distance semantic dependencies between words in a context. We present and evaluate here several methods that integrate LSA-based information with a standard language model: a semantic cache, partial reranking, and different forms of interpolation. We found that all methods show significant improvements, compared to the 4-gram baseline, and most of them to a simple cache model as well.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis