Enriching Under-Represented Named-Entities To Improve Speech Recognition   Performance

Tingzhi Mao; Yerbolat Khassanov; Van Tung Pham; Haihua Xu; Hao Huang,; Aishan Wumaier; Eng Siong Chng

arXiv:2010.12143·cs.SD·October 26, 2020

Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance

Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang,, Aishan Wumaier, Eng Siong Chng

PDF

Open Access

TL;DR

This paper introduces methods to enrich under-represented named-entities in speech recognition systems by augmenting training data, improving language models, and rescoring lattices, leading to better recognition of rare entities.

Contribution

It proposes a comprehensive approach combining exemplar utterances, enriched embeddings, and lattice rescoring to enhance recognition of under-represented named-entities in ASR.

Findings

01

Improved UR-NE occurrence in word lattices.

02

Enhanced recognition accuracy for under-represented entities.

03

Effective boosting of likelihood scores for UR-NEs.

Abstract

Automatic speech recognition (ASR) for under-represented named-entity (UR-NE) is challenging due to such named-entities (NE) have insufficient instances and poor contextual coverage in the training data to learn reliable estimates and representations. In this paper, we propose approaches to enriching UR-NEs to improve speech recognition performance. Specifically, our first priority is to ensure those UR-NEs to appear in the word lattice if there is any. To this end, we make exemplar utterances for those UR-NEs according to their categories (e.g. location, person, organization, etc.), ending up with an improved language model (LM) that boosts the UR-NE occurrence in the word lattice. With more UR-NEs appearing in the lattice, we then boost the recognition performance through lattice rescoring methods. We first enrich the representations of UR-NEs in a pre-trained recurrent neural network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis