Voiced speech as secondary response of a self-consistent fundamental   drive

Friedhelm R. Drepper

arXiv:cs/0612107·cs.SD·May 23, 2007

Voiced speech as secondary response of a self-consistent fundamental drive

Friedhelm R. Drepper

PDF

Open Access

TL;DR

This paper proposes a self-consistent decomposition method for voiced speech segments, reconstructing a fundamental drive that aligns with acoustic modes and enhances understanding of pitch perception and speech excitation.

Contribution

It introduces a novel self-consistent part-tone decomposition approach that models the fundamental drive as a topologically equivalent glottal oscillator, improving speech analysis.

Findings

01

Reconstruction of a band-limited fundamental drive from speech signals.

02

Confirmation of topological equivalence between part-tones and acoustic excitation modes.

03

Enhanced modeling of pitch perception and speech excitation dynamics.

Abstract

Voiced segments of speech are assumed to be composed of non-stationary acoustic objects which can be described as stationary response of a non-stationary fundamental drive (FD) process and which are furthermore suited to reconstruct the hidden FD by using a voice adapted (self-consistent) part-tone decomposition of the speech signal. The universality and robustness of human pitch perception encourages the reconstruction of a band-limited FD in the frequency range of the pitch. The self-consistent decomposition of voiced continuants generates several part-tones which can be confirmed to be topologically equivalent to corresponding acoustic modes of the excitation on the transmitter side. As topologically equivalent image of a glottal master oscillator, the self-consistent FD is suited to serve as low frequency part of the basic time-scale separation of auditive perception and to describe…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Neuroscience and Music Perception