# Progressive Speech Enhancement with Residual Connections

**Authors:** Jorge Llombart, Dayana Ribas, Antonio Miguel, Luis Vicente, Alfonso, Ortega, Eduardo Lleida

arXiv: 1904.04511 · 2019-04-10

## TL;DR

This paper introduces a progressive speech enhancement method using residual connections that improves performance by visually inspecting and refining the enhancement process, achieving state-of-the-art results.

## Contribution

It proposes a novel residual network architecture with progressive supervision for speech enhancement, providing insights that lead to performance improvements.

## Key findings

- Achieves state-of-the-art speech enhancement results.
- Balances dereverberation and spectral distortion effectively.
- Provides interpretability of the enhancement process.

## Abstract

This paper studies the Speech Enhancement based on Deep Neural Networks. The proposed architecture gradually follows the signal transformation during enhancement by means of a visualization probe at each network block. Alongside the process, the enhancement performance is visually inspected and evaluated in terms of regression cost. This progressive scheme is based on Residual Networks. During the process, we investigate a residual connection with a constant number of channels, including internal state between blocks, and adding progressive supervision. The insights provided by the interpretation of the network enhancement process leads us to design an improved architecture for the enhancement purpose. Following this strategy, we are able to obtain speech enhancement results beyond the state-of-the-art, achieving a favorable trade-off between dereverberation and the amount of spectral distortion.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.04511/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1904.04511/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1904.04511/full.md

---
Source: https://tomesphere.com/paper/1904.04511