Flowchase: a Mobile Application for Pronunciation Training
No\'e Tits, Zo\'e Broisson

TL;DR
Flowchase is a mobile app that offers personalized, instant pronunciation feedback to English learners by analyzing speech segments and prosody using advanced speech technology and machine learning models.
Contribution
The paper introduces Flowchase, a novel mobile application integrating speech analysis and machine learning for real-time pronunciation training and feedback.
Findings
Effective segmentation and analysis of speech features.
Real-time, personalized pronunciation feedback.
Integration of machine learning models for speech recognition.
Abstract
In this paper, we present a solution for providing personalized and instant feedback to English learners through a mobile application, called Flowchase, that is connected to a speech technology able to segment and analyze speech segmental and supra-segmental features. The speech processing pipeline receives linguistic information corresponding to an utterance to analyze along with a speech sample. After validation of the speech sample, a joint forced-alignment and phonetic recognition is performed thanks to a combination of machine learning models based on speech representation learning that provides necessary information for designing a feedback on a series of segmental and supra-segmental pronunciation aspects.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Phonetics and Phonology Research
