Expoiting Syntactic Structure for Language Modeling

Ciprian Chelba; Frederick Jelinek (CLSP The Johns Hopkins University)

arXiv:cs/9811022·cs.CL·May 23, 2007·124 cites

Expoiting Syntactic Structure for Language Modeling

Ciprian Chelba, Frederick Jelinek (CLSP The Johns Hopkins University)

PDF

Open Access

TL;DR

This paper introduces a language model that learns syntactic structures to better capture long-distance dependencies, improving prediction accuracy over traditional trigram models and suitable for speech recognition applications.

Contribution

It presents a novel syntactic structure-based language model that integrates parse structures into probability estimation, enhancing long-range dependency modeling.

Findings

01

Improved prediction accuracy over trigram models

02

Effective use of syntactic structures for language modeling

03

Model suitable for speech recognition tasks

Abstract

The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint sequence of words--binary-parse-structure with headword annotation and operates in a left-to-right manner --- therefore usable for automatic speech recognition. The model, its probabilistic parameterization, and a set of experiments meant to evaluate its predictive power are presented; an improvement over standard trigram modeling is achieved.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems