Recognition Performance of a Structured Language Model

Ciprian Chelba; Frederick Jelinek (CLSP The Johns Hopkins University)

arXiv:cs/0001022·cs.CL·May 23, 2007·32 cites

Recognition Performance of a Structured Language Model

Ciprian Chelba, Frederick Jelinek (CLSP The Johns Hopkins University)

PDF

Open Access

TL;DR

This paper introduces a structured language model that builds hierarchical representations of word sequences to improve speech recognition accuracy, demonstrating better perplexity and word error rates than traditional trigram models.

Contribution

The paper presents a novel hierarchical language model inspired by linguistic analysis, enhancing speech recognition by capturing long-distance dependencies.

Findings

01

Improved perplexity over trigram models

02

Reduced word error rate in speech recognition

03

Effective use of hierarchical structure in language modeling

Abstract

A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history - thus enabling the use of extended distance dependencies - in an attempt to complement the locality of currently used trigram models. The structured language model, its probabilistic parameterization and performance in a two-pass speech recognizer are presented. Experiments on the SWITCHBOARD corpus show an improvement in both perplexity and word error rate over conventional trigram models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Bayesian Methods and Mixture Models