Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation
Damianos Karakos, Mark Dredze, Sanjeev Khudanpur

TL;DR
This paper introduces a model that estimates ASR channel confusions to enhance self-training for topic-based language model adaptation, leading to more accurate speech recognition in conversational settings.
Contribution
The work presents a novel approach to model ASR confusions, improving self-training effectiveness for language model adaptation over traditional 1-best and lattice methods.
Findings
Improved language model adaptation results on telephone conversations.
Enhanced self-training accuracy by modeling ASR confusions.
Outperforms traditional 1-best and lattice self-training methods.
Abstract
Human language is a combination of elemental languages/domains/styles that change across and sometimes within discourses. Language models, which play a crucial role in speech recognizers and machine translation systems, are particularly sensitive to such changes, unless some form of adaptation takes place. One approach to speech language model adaptation is self-training, in which a language model's parameters are tuned based on automatically transcribed audio. However, transcription errors can misguide self-training, particularly in challenging settings such as conversational speech. In this work, we propose a model that considers the confusions (errors) of the ASR channel. By modeling the likely confusions in the ASR output instead of using just the 1-best, we improve self-training efficacy by obtaining a more reliable reference transcription estimate. We demonstrate improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques
