Flowchase: a Mobile Application for Pronunciation Training

No\'e Tits; Zo\'e Broisson

arXiv:2307.02051·eess.AS·July 6, 2023·2 cites

Flowchase: a Mobile Application for Pronunciation Training

No\'e Tits, Zo\'e Broisson

PDF

Open Access

TL;DR

Flowchase is a mobile app that offers personalized, instant pronunciation feedback to English learners by analyzing speech segments and prosody using advanced speech technology and machine learning models.

Contribution

The paper introduces Flowchase, a novel mobile application integrating speech analysis and machine learning for real-time pronunciation training and feedback.

Findings

01

Effective segmentation and analysis of speech features.

02

Real-time, personalized pronunciation feedback.

03

Integration of machine learning models for speech recognition.

Abstract

In this paper, we present a solution for providing personalized and instant feedback to English learners through a mobile application, called Flowchase, that is connected to a speech technology able to segment and analyze speech segmental and supra-segmental features. The speech processing pipeline receives linguistic information corresponding to an utterance to analyze along with a speech sample. After validation of the speech sample, a joint forced-alignment and phonetic recognition is performed thanks to a combination of machine learning models based on speech representation learning that provides necessary information for designing a feedback on a series of segmental and supra-segmental pronunciation aspects.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Phonetics and Phonology Research