Combination of Linear Prediction and Phase Decomposition for Glottal   Source Analysis on Voiced Speech

Yiqiao Chen; John N. Gowdy

arXiv:1612.04919·cs.SD·December 16, 2016

Combination of Linear Prediction and Phase Decomposition for Glottal Source Analysis on Voiced Speech

Yiqiao Chen, John N. Gowdy

PDF

Open Access

TL;DR

This paper introduces a novel glottal source analysis method combining linear prediction and phase decomposition, improving estimation accuracy without requiring glottal model fitting, suitable for real speech processing.

Contribution

It presents a new approach that integrates all-pole odd-order linear prediction with phase decomposition for enhanced glottal source estimation.

Findings

01

Improved separation of source and filter in glottal flow estimation

02

No need for glottal model fitting, increasing flexibility

03

Validated on real speech data showing performance gains

Abstract

Some glottal analysis approaches based upon linear prediction or complex cepstrum approaches have been proved to be effective to estimate glottal source from real speech utterances. We propose a new approach employing both an all-pole odd-order linear prediction to provide a coarse estimation and phase decomposition based causality/anti-causality separation to generate further refinements. The obtained measures show that this method improved performance in terms of reducing source-filter separation in estimation of glottal flow pulses (GFP). No glottal model fitting is required by this method, thus it has wide and flexible adaptation to retain fidelity of speakers's vocal features with computationally affordable resource. The method is evaluated on real speech utterances to validate it.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Voice and Speech Disorders