Blind Mask to Improve Intelligibility of Non-Stationary Noisy Speech
F. Farias, R. Coelho

TL;DR
This paper introduces a blind acoustic mask that adaptively detects noise in non-stationary noisy speech, improving intelligibility without prior knowledge of noise or speech statistics, and performs comparably to ideal masks.
Contribution
The proposed blind acoustic mask uses noise statistics to adaptively select speech segments, enhancing intelligibility without requiring prior noise or speech information.
Findings
Achieves intelligibility gains comparable to ideal masks
Maintains good speech quality in non-stationary noise conditions
Effective across various noise types and SNR levels
Abstract
This letter proposes a novel blind acoustic mask (BAM) designed to adaptively detect noise components and preserve target speech segments in time-domain. A robust standard deviation estimator is applied to the non-stationary noisy speech to identify noise masking elements. The main contribution of the proposed solution is the use of this noise statistics to derive an adaptive information to define and select samples with lower noise proportion. Thus, preserving speech intelligibility. Additionally, no information of the target speech and noise signals statistics is previously required to this non-ideal mask. The BAM and three competitive methods, Ideal Binary Mask (IBM), Target Binary Mask (TBM), and Non-stationary Noise Estimation for Speech Enhancement (NNESE), are evaluated considering speech signals corrupted by three non-stationary acoustic noises and six values of signal-to-noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBottleneck Attention Module
