Detection of Glottal Closure Instants from Speech Signals: a Quantitative Review
Thomas Drugman, Mark Thomas, Jon Gudnason, Patrick Naylor, Thierry, Dutoit

TL;DR
This paper compares five advanced algorithms for detecting Glottal Closure Instants in speech signals, evaluating their accuracy, reliability, and robustness across various conditions and applications.
Contribution
It provides a comprehensive quantitative review of GCI detection methods, including their performance under noise and reverberation, and assesses their suitability for speech processing tasks.
Findings
SEDREAMS and YAGA outperform others in clean speech detection.
ZFR and SEDREAMS are more robust to noise and reverberation.
All methods show varying reliability depending on speech quality.
Abstract
The pseudo-periodicity of voiced speech can be exploited in several speech processing applications. This requires however that the precise locations of the Glottal Closure Instants (GCIs) are available. The focus of this paper is the evaluation of automatic methods for the detection of GCIs directly from the speech waveform. Five state-of-the-art GCI detection algorithms are compared using six different databases with contemporaneous electroglottographic recordings as ground truth, and containing many hours of speech by multiple speakers. The five techniques compared are the Hilbert Envelope-based detection (HE), the Zero Frequency Resonator-based method (ZFR), the Dynamic Programming Phase Slope Algorithm (DYPSA), the Speech Event Detection using the Residual Excitation And a Mean-based Signal (SEDREAMS) and the Yet Another GCI Algorithm (YAGA). The efficacy of these methods is first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Phonetics and Phonology Research
