Parameters Optimization for Improving ASR Performance in Adverse Real World Noisy Environmental Conditions
Urmila Shrawankar, Vilas Thakare

TL;DR
This paper explores optimizing parameters like window size, frame size, and overlap in ASR systems under noisy conditions, using fuzzy inference to improve accuracy in real-world environments.
Contribution
It introduces a fuzzy inference system to select optimal parameters for ASR performance in diverse noisy environments, enhancing robustness.
Findings
Variable parameters significantly affect ASR accuracy.
Fuzzy inference system effectively determines optimal parameter settings.
Improved recognition accuracy in adverse noise conditions.
Abstract
From the existing research it has been observed that many techniques and methodologies are available for performing every step of Automatic Speech Recognition (ASR) system, but the performance (Minimization of Word Error Recognition-WER and Maximization of Word Accuracy Rate- WAR) of the methodology is not dependent on the only technique applied in that method. The research work indicates that, performance mainly depends on the category of the noise, the level of the noise and the variable size of the window, frame, frame overlap etc is considered in the existing methods. The main aim of the work presented in this paper is to use variable size of parameters like window size, frame size and frame overlap percentage to observe the performance of algorithms for various categories of noise with different levels and also train the system for all size of parameters and category of real world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Blind Source Separation Techniques
