Acoustic characterization of speech rhythm: going beyond metrics with recurrent neural networks
Fran\c{c}ois Deloche, Laurent Bonnasse-Gahot, Judit Gervain

TL;DR
This study uses recurrent neural networks to analyze speech rhythm, revealing complex acoustic patterns that align with linguistic rhythmic typologies and surpass traditional metrics in capturing rhythm complexity.
Contribution
It demonstrates that deep learning models can effectively characterize speech rhythm and relate neural representations to established rhythmic metrics, advancing the understanding of speech prosody.
Findings
Neural network identified language with 40% accuracy on 10-second recordings.
Representations reflect speech rhythm typologies beyond simple clusters.
Correlations found between network activations and known rhythm metrics.
Abstract
Languages have long been described according to their perceived rhythmic attributes. The associated typologies are of interest in psycholinguistics as they partly predict newborns' abilities to discriminate between languages and provide insights into how adult listeners process non-native languages. Despite the relative success of rhythm metrics in supporting the existence of linguistic rhythmic classes, quantitative studies have yet to capture the full complexity of temporal regularities associated with speech rhythm. We argue that deep learning offers a powerful pattern-recognition approach to advance the characterization of the acoustic bases of speech rhythm. To explore this hypothesis, we trained a medium-sized recurrent neural network on a language identification task over a large database of speech recordings in 21 languages. The network had access to the amplitude envelopes and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonetics and Phonology Research · Language and cultural evolution · Language Development and Disorders
