Integrated Speech Enhancement Method Based on Weighted Prediction Error   and DNN for Dereverberation and Denoising

Hao Li; Xueliang Zhang; Hui Zhang; Guanglai Gao

arXiv:1708.08251·cs.SD·August 29, 2017·2 cites

Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising

Hao Li, Xueliang Zhang, Hui Zhang, Guanglai Gao

PDF

Open Access

TL;DR

This paper proposes an integrated speech enhancement method combining WPE and DNN to improve dereverberation and denoising, achieving faster processing and better speech quality.

Contribution

It introduces a novel integration of DNN with WPE to address noise influence and eliminate iterative processing, enhancing efficiency and effectiveness.

Findings

01

Significant improvement in speech quality.

02

Faster processing compared to traditional WPE.

03

Effective noise suppression and dereverberation.

Abstract

Both reverberation and additive noises degrade the speech quality and intelligibility. Weighted prediction error (WPE) method performs well on the dereverberation but with limitations. First, WPE doesn't consider the influence of the additive noise which degrades the performance of dereverberation. Second, it relies on a time-consuming iterative process, and there is no guarantee or a widely accepted criterion on its convergence. In this paper, we integrate deep neural network (DNN) into WPE for dereverberation and denoising. DNN is used to suppress the background noise to meet the noise-free assumption of WPE. Meanwhile, DNN is applied to directly predict spectral variance of the target speech to make the WPE work without iteration. The experimental results show that the proposed method has a significant improvement in speech quality and runs fast.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Infant Health and Development · Speech Recognition and Synthesis