Phonetic Segmentation of the UCLA Phonetics Lab Archive
Eleanor Chodroff, Bla\v{z} Pa\v{z}on, Annie Baker, Steven Moran

TL;DR
VoxAngeles is a newly curated corpus providing detailed phonetic transcriptions, alignments, and measurements for the UCLA Phonetics Lab Archive's multilingual speech data, facilitating research in phonetics and speech technology.
Contribution
It introduces VoxAngeles, a comprehensive phonetic dataset with segmentations and measurements, enhancing the usability of the UCLA archive for linguistic and technological applications.
Findings
Demonstrated utility in phonetic typology research
Enabled analysis of vowel intrinsic f0 across languages
Enhanced data accessibility for low-resource speech technologies
Abstract
Research in speech technologies and comparative linguistics depends on access to diverse and accessible speech data. The UCLA Phonetics Lab Archive is one of the earliest multilingual speech corpora, with long-form audio recordings and phonetic transcriptions for 314 languages (Ladefoged et al., 2009). Recently, 95 of these languages were time-aligned with word-level phonetic transcriptions (Li et al., 2021). Here we present VoxAngeles, a corpus of audited phonetic transcriptions and phone-level alignments of the UCLA Phonetics Lab Archive, which uses the 95-language CMU re-release as our starting point. VoxAngeles also includes word- and phone-level segmentations from the original UCLA corpus, as well as phonetic measurements of word and phone durations, vowel formants, and vowel f0. This corpus enhances the usability of the original data, particularly for quantitative phonetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonetics and Phonology Research
