PCA Method for Automated Detection of Mispronounced Words
Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

TL;DR
This paper introduces a PCA-based hierarchical method for detecting mispronounced words in language learning, achieving high accuracy and computational efficiency, especially with limited training data.
Contribution
It proposes a novel PCA-based hierarchical algorithm for mispronunciation detection, demonstrating superior efficiency and accuracy over traditional HMM-based methods in limited data scenarios.
Findings
Achieved up to 99% accuracy in word verification
Attained 93% accuracy in native/non-native classification
Outperformed HMMs in computational efficiency and effectiveness
Abstract
This paper presents a method for detecting mispronunciations with the aim of improving Computer Assisted Language Learning (CALL) tools used by foreign language learners. The algorithm is based on Principle Component Analysis (PCA). It is hierarchical with each successive step refining the estimate to classify the test word as being either mispronounced or correct. Preprocessing before detection, like normalization and time-scale modification, is implemented to guarantee uniformity of the feature vectors input to the detection system. The performance using various features including spectrograms and Mel-Frequency Cepstral Coefficients (MFCCs) are compared and evaluated. Best results were obtained using MFCCs, achieving up to 99% accuracy in word verification and 93% in native/non-native classification. Compared with Hidden Markov Models (HMMs) which are used pervasively in recognition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
