Who Said What? An Automated Approach to Analyzing Speech in Preschool Classrooms
Anchen Sun, Juan J Londono, Batya Elbaum, Luis Estrada, Roberto Jose, Lazo, Laura Vitale, Hugo Gonzalez Villasanti, Riccardo Fusaroli, Lynn K, Perry, Daniel S Messinger

TL;DR
This study introduces an automated speech analysis framework for preschool classrooms using open source tools, enabling efficient classification and transcription of child and teacher speech to support language development research.
Contribution
It presents a novel automated framework combining speaker classification and transcription for preschool classroom recordings, validated against expert annotations.
Findings
Achieved 76% agreement with human expert in speaker classification.
Word error rate of 15% for transcriptions, comparable to human transcriptions.
Speech feature analysis showed consistency between automated and expert transcriptions.
Abstract
Young children spend substantial portions of their waking hours in noisy preschool classrooms. In these environments, children's vocal interactions with teachers are critical contributors to their language outcomes, but manually transcribing these interactions is prohibitive. Using audio from child- and teacher-worn recorders, we propose an automated framework that uses open source software both to classify speakers (ALICE) and to transcribe their utterances (Whisper). We compare results from our framework to those from a human expert for 110 minutes of classroom recordings, including 85 minutes from child-word microphones (n=4 children) and 25 minutes from teacher-worn microphones (n=2 teachers). The overall proportion of agreement, that is, the proportion of correctly classified teacher and child utterances, was .76, with an error-corrected kappa of .50 and a weighted F1 of .76. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage Development and Disorders · Speech and dialogue systems · Speech Recognition and Synthesis
