Speech Enhancement Based on Non-stationary Noise-driven Geometric Spectral Subtraction and Phase Spectrum Compensation
Md Tauhidul Islam, Udoy Saha, K.T. Shahid, Ahmed Bin Hussain, Celia, Shahnaz

TL;DR
This paper introduces a novel speech enhancement technique that adaptively tracks non-stationary noise using a geometric spectral subtraction approach combined with phase spectrum compensation, improving speech clarity in noisy environments.
Contribution
It proposes a non-stationary noise-driven geometric spectral subtraction method that utilizes low frequency regions for noise estimation, enhancing speech quality over existing methods.
Findings
Outperforms recent speech enhancement methods in objective measures
Effective in reducing street and babble noise at various SNR levels
Improves speech intelligibility and quality in simulations
Abstract
In this paper, a speech enhancement method based on noise compensation performed on short time magnitude as well phase spectra is presented. Unlike the conventional geometric approach (GA) to spectral subtraction (SS), here the noise estimate to be subtracted from the noisy speech spectrum is proposed to be determined by exploiting the low frequency regions of current frame of noisy speech rather than depending only on the initial silence frames. This approach gives the capability of tracking non-stationary noise thus resulting in a non-stationary noise-driven geometric approach of spectral subtraction for speech enhancement. The noise compensated magnitude spectrum from the GA step is then recombined with unchanged phase of noisy speech spectrum and used in phase compensation to obtain an enhanced complex spectrum, which is used to produce an enhanced speech frame. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Blind Source Separation Techniques
