Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN
Muhammad Danyal Khan, Raheem Ali, Arshad Aziz

TL;DR
This paper presents a resource-efficient hybrid HMM and CNN-TDNN based speech recognition system for noisy telephonic environments handling code-switched Urdu and English, achieving low WER in call center scenarios.
Contribution
It introduces a novel framework combining Chain Hybrid HMM and CNN-TDNN for robust Urdu speech recognition in noisy, code-switched telephonic environments, addressing resource constraints.
Findings
Achieved 5.2% WER in noisy and clean environments
Demonstrated effectiveness of CNN-TDNN in noisy conditions
Utilized limited labeled data with hybrid neural network approach
Abstract
Call Centers have huge amount of audio data which can be used for achieving valuable business insights and transcription of phone calls is manually tedious task. An effective Automated Speech Recognition system can accurately transcribe these calls for easy search through call history for specific context and content allowing automatic call monitoring, improving QoS through keyword search and sentiment analysis. ASR for Call Center requires more robustness as telephonic environment are generally noisy. Moreover, there are many low-resourced languages that are on verge of extinction which can be preserved with help of Automatic Speech Recognition Technology. Urdu is the most widely spoken language in the world, with 231,295,440 worldwide still remains a resource constrained language in ASR. Regional call-center conversations operate in local language, with a mix of English…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Advanced Data Compression Techniques
