Automatic Pronunciation Assessment -- A Review

Yassine El Kheir; Ahmed Ali; and Shammur Absar Chowdhury

arXiv:2310.13974·cs.CL·October 24, 2023·1 cites

Automatic Pronunciation Assessment -- A Review

Yassine El Kheir, Ahmed Ali, and Shammur Absar Chowdhury

PDF

Open Access

TL;DR

This paper reviews recent advances in automatic pronunciation assessment, covering phonemic and prosodic evaluation, highlighting challenges, limitations, resources, and future research directions in the field.

Contribution

It provides a comprehensive update on methods, challenges, and resources in pronunciation assessment, integrating recent deep learning approaches and categorizing research trends.

Findings

01

Identification of key challenges in pronunciation assessment

02

Analysis of existing limitations and resources

03

Discussion of future research directions

Abstract

Pronunciation assessment and its application in computer-aided pronunciation training (CAPT) have seen impressive progress in recent years. With the rapid growth in language processing and deep learning over the past few years, there is a need for an updated review. In this paper, we review methods employed in pronunciation assessment for both phonemic and prosodic. We categorize the main challenges observed in prominent research trends, and highlight existing limitations, and available resources. This is followed by a discussion of the remaining challenges and possible directions for future work.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Phonetics and Phonology Research