Distributed Microphone Speech Enhancement based on Deep Learning

Syu-Siang Wang; Yu-You Liang; Jeih-weih Hung; Yu Tsao; Hsin-Min Wang,; Shih-Hau Fang

arXiv:1911.08153·eess.AS·May 26, 2020·5 cites

Distributed Microphone Speech Enhancement based on Deep Learning

Syu-Siang Wang, Yu-You Liang, Jeih-weih Hung, Yu Tsao, Hsin-Min Wang,, Shih-Hau Fang

PDF

Open Access

TL;DR

This paper explores three deep neural network-based speech enhancement systems in distributed microphone setups, demonstrating improved speech quality and intelligibility in noisy environments, with the channel-dependent fusion system performing best.

Contribution

It introduces and compares three novel DNN-based speech enhancement architectures for distributed microphone arrays, highlighting the effectiveness of channel-dependent fusion.

Findings

01

All three DNN systems improve speech quality and intelligibility.

02

The channel-dependent DNN fusion system achieves the highest SNR improvement.

03

Experimental results validate the effectiveness of the proposed methods.

Abstract

Speech-related applications deliver inferior performance in complex noise environments. Therefore, this study primarily addresses this problem by introducing speech-enhancement (SE) systems based on deep neural networks (DNNs) applied to a distributed microphone architecture, and then investigates the effectiveness of three different DNN-model structures. The first system constructs a DNN model for each microphone to enhance the recorded noisy speech signal, and the second system combines all the noisy recordings into a large feature structure that is then enhanced through a DNN model. As for the third system, a channel-dependent DNN is first used to enhance the corresponding noisy input, and all the channel-wise enhanced outputs are fed into a DNN fusion model to construct a nearly clean signal. All the three DNN SE systems are operated in the acoustic frequency domain of speech…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Advanced Adaptive Filtering Techniques

MethodsTest