Learning-based A Posteriori Speech Presence Probability Estimation and Applications
Shuai Tao, Jesper Rindom Jensen, Yang Xiang, Himavanth Reddy,, Qingzheng Zhang, Mads Gr{\ae}sb{\o}ll Christensen

TL;DR
This paper introduces a deep learning-based a posteriori speech presence probability estimator that improves noise PSD estimation and speech enhancement, especially in non-stationary noise, with low model complexity.
Contribution
It proposes a hybrid global-local deep neural network architecture for more accurate SPP estimation in challenging noise conditions.
Findings
Achieves higher noise PSD estimation accuracy.
Enhances speech quality in non-stationary noise environments.
Requires low model complexity for real-time applications.
Abstract
The a posteriori speech presence probability (SPP) is the fundamental component of noise power spectral density (PSD) estimation, which can contribute to speech enhancement and speech recognition systems. Most existing SPP estimators can estimate SPP accurately from the background noise. Nevertheless, numerous challenges persist, including the difficulty of accurately estimating SPP from non-stationary noise with statistics-based methods and the high latency associated with deep learning-based approaches. This paper presents an improved SPP estimation approach based on deep learning to achieve higher SPP estimation accuracy, especially in non-stationary noise conditions. To promote the information extraction performance of the DNN, the global information of the observed signal and the local information of the decoupled frequency bins from the observed signal are connected as hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Speech and dialogue systems
