Recognize Foreign Low-Frequency Words with Similar Pairs

Xi Ma; Xiaoxi Wang; Dong Wang; Zhiyong Zhang

arXiv:1506.04940·cs.CL·June 17, 2015

Recognize Foreign Low-Frequency Words with Similar Pairs

Xi Ma, Xiaoxi Wang, Dong Wang, Zhiyong Zhang

PDF

Open Access

TL;DR

This paper improves automatic speech recognition of low-frequency and out-of-language words by extending a word-pair method that leverages information from multiple predicting words, enhancing probability estimates.

Contribution

It introduces an extension of the word-pair approach using multiple predicting words and applies it to multilingual speech recognition for better low-frequency word recognition.

Findings

01

Enhanced recognition accuracy for low-frequency words.

02

Effective handling of out-of-language words in multilingual ASR.

03

Improved probability estimation for rare words.

Abstract

Low-frequency words place a major challenge for automatic speech recognition (ASR). The probabilities of these words, which are often important name entities, are generally under-estimated by the language model (LM) due to their limited occurrences in the training data. Recently, we proposed a word-pair approach to deal with the problem, which borrows information of frequent words to enhance the probabilities of low-frequency words. This paper presents an extension to the word-pair method by involving multiple `predicting words' to produce better estimation for low-frequency words. We also employ this approach to deal with out-of-language words in the task of multi-lingual speech recognition.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling