Voiced speech as secondary response of a self-consistent fundamental drive
Friedhelm R. Drepper

TL;DR
This paper proposes a self-consistent decomposition method for voiced speech segments, reconstructing a fundamental drive that aligns with acoustic modes and enhances understanding of pitch perception and speech excitation.
Contribution
It introduces a novel self-consistent part-tone decomposition approach that models the fundamental drive as a topologically equivalent glottal oscillator, improving speech analysis.
Findings
Reconstruction of a band-limited fundamental drive from speech signals.
Confirmation of topological equivalence between part-tones and acoustic excitation modes.
Enhanced modeling of pitch perception and speech excitation dynamics.
Abstract
Voiced segments of speech are assumed to be composed of non-stationary acoustic objects which can be described as stationary response of a non-stationary fundamental drive (FD) process and which are furthermore suited to reconstruct the hidden FD by using a voice adapted (self-consistent) part-tone decomposition of the speech signal. The universality and robustness of human pitch perception encourages the reconstruction of a band-limited FD in the frequency range of the pitch. The self-consistent decomposition of voiced continuants generates several part-tones which can be confirmed to be topologically equivalent to corresponding acoustic modes of the excitation on the transmitter side. As topologically equivalent image of a glottal master oscillator, the self-consistent FD is suited to serve as low frequency part of the basic time-scale separation of auditive perception and to describe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Neuroscience and Music Perception
