Semi-supervised Speech Enhancement in Envelop and Details Subspaces

Pengfei Sun; Jun Qin

arXiv:1609.09443·cs.SD·February 24, 2017

Semi-supervised Speech Enhancement in Envelop and Details Subspaces

Pengfei Sun, Jun Qin

PDF

Open Access

TL;DR

This paper introduces a novel speech enhancement framework that decouples noisy speech spectrograms into envelope and detail subspaces, employing low-rank and sparse decomposition techniques to improve speech intelligibility and quality.

Contribution

It presents a new modulation decoupling subspace approach with supervised decomposition schemes and Bayesian learning for robust speech enhancement, outperforming existing algorithms.

Findings

01

Improved speech intelligibility and perceptual quality.

02

Outperforms four existing speech enhancement algorithms.

03

Effective noise reduction in both envelope and detail subspaces.

Abstract

In this study, we propose a modulation decoupling based single channel speech enhancement subspace framework, in which the spectrogram of noisy speech is decoupled as the product of a spectral envelop subspace and a spectral details subspace. This decoupling approach provides a method to specifically work on elimination of those noises that greatly affect the intelligibility. Two supervised low-rank and sparse decomposition schemes are developed in the spectral envelop subspace to obtain a robust recovery of speech components. A Bayesian formulation of non-negative factorization is used to learn the speech dictionary from the spectral envelop subspace of clean speech samples. In the spectral details subspace, a standard robust principal component analysis is implemented to extract the speech components. The validation results show that compared with four speech enhancement algorithms,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Blind Source Separation Techniques