Statistics in Phonetics

Shahin Tavakoli; Beatrice Matteo; Davide Pigoli; Eleanor Chodroff,; John Coleman; Michele Gubian; Margaret E. L. Renwick; Morgan Sonderegger

arXiv:2404.07567·stat.AP·June 12, 2024·1 cites

Statistics in Phonetics

Shahin Tavakoli, Beatrice Matteo, Davide Pigoli, Eleanor Chodroff,, John Coleman, Michele Gubian, Margaret E. L. Renwick, Morgan Sonderegger

PDF

Open Access

TL;DR

This paper introduces statistical and signal processing methods used in phonetics, highlighting recent shifts towards advanced modeling techniques driven by large speech datasets and machine learning applications.

Contribution

It provides a comprehensive overview of statistical methods in phonetics, emphasizing recent advances like Bayesian modeling and curve data analysis.

Findings

01

Shift from fixed to random effects models

02

Use of Bayesian and curve data methods

03

Impact of machine learning on phonetic analysis

Abstract

Phonetics is the scientific field concerned with the study of how speech is produced, heard and perceived. It abounds with data, such as acoustic speech recordings, neuroimaging data, or articulatory data. In this paper, we provide an introduction to different areas of phonetics (acoustic phonetics, sociophonetics, speech perception, articulatory phonetics, speech inversion, sound change, and speech technology), an overview of the statistical methods for analyzing their data, and an introduction to the signal processing methods commonly applied to speech recordings. A major transition in the statistical modeling of phonetic data has been the shift from fixed effects to random effects regression models, the modeling of curve data (for instance via GAMMs or FDA methods), and the use of Bayesian methods. This shift has been driven in part by the increased focus on large speech corpora in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing