Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Sukhdeep S. Sodhi, Ellie Ka-In Chio, Ambarish Jash, Santiago, Onta\~n\'on, Ajit Apte, Ankit Kumar, Ayooluwakunmi Jeje, Dima Kuzmin, Harry, Fung, Heng-Tze Cheng, Jon Effrat, Tarush Bali, Nitin Jindal, Pei Cao,, Sarvjeet Singh, Senqiang Zhou, Tameen Khan, Amol Wankhede

TL;DR
Mondegreen is a post-processing method that corrects voice search queries in text form, improving user satisfaction without relying on audio signals, especially useful for on-device or privacy-sensitive applications.
Contribution
This paper introduces Mondegreen, a novel text-based correction approach for voice queries that enhances search relevance without depending on audio data.
Findings
Significant improvement in user interaction with corrected queries
Effective correction across multiple proprietary ASR systems
Complementary to existing ASR systems, addressing vocabulary drift
Abstract
As more and more online search queries come from voice, automatic speech recognition becomes a key component to deliver relevant search results. Errors introduced by automatic speech recognition (ASR) lead to irrelevant search results returned to the user, thus causing user dissatisfaction. In this paper, we introduce an approach, Mondegreen, to correct voice queries in text space without depending on audio signals, which may not always be available due to system constraints or privacy or bandwidth (for example, some ASR systems run on-device) considerations. We focus on voice queries transcribed via several proprietary commercial ASR systems. These queries come from users making internet, or online service search queries. We first present an analysis showing how different the language distribution coming from user voice queries is from that in traditional text corpora used to train…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
