Using phonetic constraints in acoustic-to-articulatory inversion

Blaise Potard (INRIA Lorraine - LORIA); Yves Laprie (INRIA Lorraine -; LORIA)

arXiv:cs/0511076·cs.CL·May 23, 2007

Using phonetic constraints in acoustic-to-articulatory inversion

Blaise Potard (INRIA Lorraine - LORIA), Yves Laprie (INRIA Lorraine -, LORIA)

PDF

Open Access

TL;DR

This paper proposes a method for acoustic-to-articulatory inversion that incorporates phonetic constraints based on French vowel formants to improve the realism of the reconstructed articulatory movements.

Contribution

It introduces a novel approach using phonetic constraints derived from formant frequencies to enhance inversion accuracy and realism.

Findings

01

Phonetic constraints improve inversion results.

02

Articulatory parameters align better with X-ray data.

03

Method effectively distinguishes vowel transitions.

Abstract

The goal of this work is to recover articulatory information from the speech signal by acoustic-to-articulatory inversion. One of the main difficulties with inversion is that the problem is underdetermined and inversion methods generally offer no guarantee on the phonetical realism of the inverse solutions. A way to adress this issue is to use additional phonetic constraints. Knowledge of the phonetic caracteristics of French vowels enable the derivation of reasonable articulatory domains in the space of Maeda parameters: given the formants frequencies (F1,F2,F3) of a speech sample, and thus the vowel identity, an "ideal" articulatory domain can be derived. The space of formants frequencies is partitioned into vowels, using either speaker-specific data or generic information on formants. Then, to each articulatory vector can be associated a phonetic score varying with the distance to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhonetics and Phonology Research · Speech Recognition and Synthesis