# A variance modeling framework based on variational autoencoders for   speech enhancement

**Authors:** Simon Leglaive, Laurent Girin, Radu Horaud

arXiv: 1902.01605 · 2019-02-06

## TL;DR

This paper introduces a novel speech enhancement method using variational autoencoders for variance modeling, combining supervised VAE for speech with unsupervised NMF for noise, outperforming existing models.

## Contribution

It presents a new VAE-based variance modeling framework for speech enhancement that effectively integrates supervised and unsupervised learning approaches.

## Key findings

- Outperforms semi-supervised NMF baseline
- Outperforms state-of-the-art deep learning methods
- Effective in noisy environments

## Abstract

In this paper we address the problem of enhancing speech signals in noisy mixtures using a source separation approach. We explore the use of neural networks as an alternative to a popular speech variance model based on supervised non-negative matrix factorization (NMF). More precisely, we use a variational autoencoder as a speaker-independent supervised generative speech model, highlighting the conceptual similarities that this approach shares with its NMF-based counterpart. In order to be free of generalization issues regarding the noisy recording environments, we follow the approach of having a supervised model only for the target speech signal, the noise model being based on unsupervised NMF. We develop a Monte Carlo expectation-maximization algorithm for inferring the latent variables in the variational autoencoder and estimating the unsupervised model parameters. Experiments show that the proposed method outperforms a semi-supervised NMF baseline and a state-of-the-art fully supervised deep learning approach.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.01605/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1902.01605/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1902.01605/full.md

---
Source: https://tomesphere.com/paper/1902.01605