Phonetic Segmentation of the UCLA Phonetics Lab Archive

Eleanor Chodroff; Bla\v{z} Pa\v{z}on; Annie Baker; Steven Moran

arXiv:2403.19509·cs.CL·March 29, 2024·1 cites

Phonetic Segmentation of the UCLA Phonetics Lab Archive

Eleanor Chodroff, Bla\v{z} Pa\v{z}on, Annie Baker, Steven Moran

PDF

Open Access 1 Repo

TL;DR

VoxAngeles is a newly curated corpus providing detailed phonetic transcriptions, alignments, and measurements for the UCLA Phonetics Lab Archive's multilingual speech data, facilitating research in phonetics and speech technology.

Contribution

It introduces VoxAngeles, a comprehensive phonetic dataset with segmentations and measurements, enhancing the usability of the UCLA archive for linguistic and technological applications.

Findings

01

Demonstrated utility in phonetic typology research

02

Enabled analysis of vowel intrinsic f0 across languages

03

Enhanced data accessibility for low-resource speech technologies

Abstract

Research in speech technologies and comparative linguistics depends on access to diverse and accessible speech data. The UCLA Phonetics Lab Archive is one of the earliest multilingual speech corpora, with long-form audio recordings and phonetic transcriptions for 314 languages (Ladefoged et al., 2009). Recently, 95 of these languages were time-aligned with word-level phonetic transcriptions (Li et al., 2021). Here we present VoxAngeles, a corpus of audited phonetic transcriptions and phone-level alignments of the UCLA Phonetics Lab Archive, which uses the 95-language CMU re-release as our starting point. VoxAngeles also includes word- and phone-level segmentations from the original UCLA corpus, as well as phonetic measurements of word and phone durations, vowel formants, and vowel f0. This corpus enhances the usability of the original data, particularly for quantitative phonetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pacscilab/voxangeles
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhonetics and Phonology Research