Listening to Sounds of Silence for Speech Denoising

Ruilin Xu; Rundi Wu; Yuko Ishiwaka; Carl Vondrick; Changxi Zheng

arXiv:2010.12013·cs.SD·October 26, 2020·23 cites

Listening to Sounds of Silence for Speech Denoising

Ruilin Xu, Rundi Wu, Yuko Ishiwaka, Carl Vondrick, Changxi Zheng

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a deep learning approach for speech denoising that exploits incidental silent intervals in speech signals to effectively learn noise characteristics and improve denoising performance across languages.

Contribution

The study introduces a novel speech denoising method that leverages silent intervals to learn noise dynamics, outperforming existing methods and demonstrating strong generalization.

Findings

01

Outperforms several state-of-the-art denoising methods

02

Effectively generalizes to unseen spoken languages

03

Utilizes silent intervals to learn noise features

Abstract

We introduce a deep learning model for speech denoising, a long-standing challenge in audio analysis arising in numerous applications. Our approach is based on a key observation about human speech: there is often a short pause between each sentence or word. In a recorded speech signal, those pauses introduce a series of time periods during which only noise is present. We leverage these incidental silent intervals to learn a model for automatic speech denoising given only mono-channel audio. Detected silent intervals over time expose not just pure noise but its time-varying features, allowing the model to learn noise dynamics and suppress it from the speech signal. Experiments on multiple datasets confirm the pivotal role of silent interval detection for speech denoising, and our method outperforms several state-of-the-art denoising methods, including those that accept only audio input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

henryxrl/Listening-to-Sound-of-Silence-for-Speech-Denoising
pytorchOfficial

Videos

Listening to Sounds of Silence for Speech Denoising· slideslive

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis