PCA Method for Automated Detection of Mispronounced Words

Zhenhao Ge; Sudhendu R. Sharma; Mark J. T. Smith

arXiv:1602.08128·cs.SD·February 29, 2016

PCA Method for Automated Detection of Mispronounced Words

Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

PDF

TL;DR

This paper introduces a PCA-based hierarchical method for detecting mispronounced words in language learning, achieving high accuracy and computational efficiency, especially with limited training data.

Contribution

It proposes a novel PCA-based hierarchical algorithm for mispronunciation detection, demonstrating superior efficiency and accuracy over traditional HMM-based methods in limited data scenarios.

Findings

01

Achieved up to 99% accuracy in word verification

02

Attained 93% accuracy in native/non-native classification

03

Outperformed HMMs in computational efficiency and effectiveness

Abstract

This paper presents a method for detecting mispronunciations with the aim of improving Computer Assisted Language Learning (CALL) tools used by foreign language learners. The algorithm is based on Principle Component Analysis (PCA). It is hierarchical with each successive step refining the estimate to classify the test word as being either mispronounced or correct. Preprocessing before detection, like normalization and time-scale modification, is implemented to guarantee uniformity of the feature vectors input to the detection system. The performance using various features including spectrograms and Mel-Frequency Cepstral Coefficients (MFCCs) are compared and evaluated. Best results were obtained using MFCCs, achieving up to 99% accuracy in word verification and 93% in native/non-native classification. Compared with Hidden Markov Models (HMMs) which are used pervasively in recognition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.