Exploring British Accents: Modelling the Trap-Bath Split with Functional Data Analysis
Aranya Koshy, Shahin Tavakoli

TL;DR
This paper uses functional data analysis and generalized additive models to model and visualize regional accent differences in British speech, focusing on the trap-bath vowel split, with applications to classifying accents geographically.
Contribution
It introduces a novel application of functional data analysis and soap film smoothers to model and visualize regional accent variation in British speech.
Findings
Successful classification of Northern and Southern vowels
Effective visualization of geographical accent patterns
Demonstration of flexible modeling approach
Abstract
The sound of our speech is influenced by the places we come from. Great Britain contains a wide variety of distinctive accents which are of interest to linguistics. In particular, the "a" vowel in words like "class" is pronounced differently in the North and the South. Speech recordings of this vowel can be represented as formant curves or as Mel-frequency cepstral coefficient curves. Functional data analysis and generalized additive models offer techniques to model the variation in these curves. Our first aim is to model the difference between typical Northern and Southern vowels /ae/ and /a/, by training two classifiers on the North-South Class Vowels dataset collected for this paper (Koshy 2020). Our second aim is to visualize geographical variation of accents in Great Britain. For this we use speech recordings from a second dataset, the British National Corpus (BNC) audio edition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonetics and Phonology Research · Speech Recognition and Synthesis · Linguistic Variation and Morphology
