Speech Enhancement Based on Non-stationary Noise-driven Geometric   Spectral Subtraction and Phase Spectrum Compensation

Md Tauhidul Islam; Udoy Saha; K.T. Shahid; Ahmed Bin Hussain; Celia; Shahnaz

arXiv:1803.02870·eess.AS·March 9, 2018·1 cites

Speech Enhancement Based on Non-stationary Noise-driven Geometric Spectral Subtraction and Phase Spectrum Compensation

Md Tauhidul Islam, Udoy Saha, K.T. Shahid, Ahmed Bin Hussain, Celia, Shahnaz

PDF

Open Access

TL;DR

This paper introduces a novel speech enhancement technique that adaptively tracks non-stationary noise using a geometric spectral subtraction approach combined with phase spectrum compensation, improving speech clarity in noisy environments.

Contribution

It proposes a non-stationary noise-driven geometric spectral subtraction method that utilizes low frequency regions for noise estimation, enhancing speech quality over existing methods.

Findings

01

Outperforms recent speech enhancement methods in objective measures

02

Effective in reducing street and babble noise at various SNR levels

03

Improves speech intelligibility and quality in simulations

Abstract

In this paper, a speech enhancement method based on noise compensation performed on short time magnitude as well phase spectra is presented. Unlike the conventional geometric approach (GA) to spectral subtraction (SS), here the noise estimate to be subtracted from the noisy speech spectrum is proposed to be determined by exploiting the low frequency regions of current frame of noisy speech rather than depending only on the initial silence frames. This approach gives the capability of tracking non-stationary noise thus resulting in a non-stationary noise-driven geometric approach of spectral subtraction for speech enhancement. The noise compensated magnitude spectrum from the GA step is then recombined with unchanged phase of noisy speech spectrum and used in phase compensation to obtain an enhanced complex spectrum, which is used to produce an enhanced speech frame. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Blind Source Separation Techniques