Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation
Zhangli Chen, Volker Hohmann

TL;DR
This paper introduces an online monaural speech enhancement algorithm that utilizes a novel phase-corrected filterbank and a periodicity degree feature for robust noise suppression and fundamental period detection.
Contribution
It proposes a new periodicity degree feature and a phase-corrected filterbank for improved real-time speech enhancement and pitch detection in noisy environments.
Findings
The algorithm achieves high P0 detection accuracy across different corpora.
It outperforms or matches state-of-the-art algorithms in various noise conditions.
The method is effective in non-stationary noise environments.
Abstract
This paper describes an online algorithm for enhancing monaural noisy speech. Firstly, a novel phase-corrected low-delay gammatone filterbank is derived for signal subband decomposition and resynthesis; the subband signals are then analyzed frame by frame. Secondly, a novel feature named periodicity degree (PD) is proposed to be used for detecting and estimating the fundamental period (P0) in each frame and for estimating the signal-to-noise ratio (SNR) in each frame-subband signal unit. The PD is calculated in each unit as the multiplication of the normalized autocorrelation and the comb filter ratio, and shown to be robust in various low-SNR conditions. Thirdly, the noise energy level in each signal unit is estimated recursively based on the estimated SNR for units with high PD and based on the noisy signal energy level for units with low PD. Then the a priori SNR is estimated using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Blind Source Separation Techniques
