Glottal Source Estimation using an Automatic Chirp Decomposition

Thomas Drugman; Baris Bozkurt; Thierry Dutoit

arXiv:2005.07897·cs.SD·May 19, 2020

Glottal Source Estimation using an Automatic Chirp Decomposition

Thomas Drugman, Baris Bozkurt, Thierry Dutoit

PDF

Open Access

TL;DR

This paper introduces a robust method for estimating the glottal source from speech signals by extending ZZT to ZCZT, improving accuracy and robustness to GCI location errors.

Contribution

It extends the ZZT formalism by automatically determining the Z-transform contour, enhancing robustness in glottal source estimation.

Findings

01

ZCZT-based method is more robust to GCI errors

02

Automatic contour selection improves deconvolution accuracy

03

Extension of ZZT formalism to non-unit circle contours

Abstract

In a previous work, we showed that the glottal source can be estimated from speech signals by computing the Zeros of the Z-Transform (ZZT). Decomposition was achieved by separating the roots inside (causal contribution) and outside (anticausal contribution) the unit circle. In order to guarantee a correct deconvolution, time alignment on the Glottal Closure Instants (GCIs) was shown to be essential. This paper extends the formalism of ZZT by evaluating the Z-transform on a contour possibly different from the unit circle. A method is proposed for determining automatically this contour by inspecting the root distribution. The derived Zeros of the Chirp Z-Transform (ZCZT)-based technique turns out to be much more robust to GCI location errors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Phonetics and Phonology Research