Speech Enhancement In Multiple-Noise Conditions using Deep Neural   Networks

Anurag Kumar; Dinei Florencio

arXiv:1605.02427·cs.SD·May 10, 2016

Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks

Anurag Kumar, Dinei Florencio

PDF

2 Repos

TL;DR

This paper addresses speech enhancement in complex real-world environments with multiple simultaneous noises by proposing DNN-based strategies, including psychoacoustic model-based training, to improve speech quality.

Contribution

It introduces novel DNN training strategies tailored for multi-noise conditions and incorporates psychoacoustic models for enhanced speech restoration.

Findings

01

DNN strategies significantly improve speech clarity in multi-noise environments

02

Psychoacoustic model-based training enhances noise suppression effectiveness

03

Approaches outperform traditional single-noise enhancement methods

Abstract

In this paper we consider the problem of speech enhancement in real-world like conditions where multiple noises can simultaneously corrupt speech. Most of the current literature on speech enhancement focus primarily on presence of single noise in corrupted speech which is far from real-world environments. Specifically, we deal with improving speech quality in office environment where multiple stationary as well as non-stationary noises can be simultaneously present in speech. We propose several strategies based on Deep Neural Networks (DNN) for speech enhancement in these scenarios. We also investigate a DNN training strategy based on psychoacoustic models from speech coding for enhancement of noisy speech

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.