# Speech Enhancement with Wide Residual Networks in Reverberant   Environments

**Authors:** Jorge Llombart, Dayana Ribas, Antonio Miguel, Luis Vicente, Alfonso, Ortega, Eduardo Lleida

arXiv: 1904.05167 · 2019-04-11

## TL;DR

This paper introduces a speech enhancement method using Wide Residual Networks with residual connections and 1D convolutions, outperforming state-of-the-art techniques in reverberant environments and generalizing well to real-world data.

## Contribution

The paper presents a novel residual network architecture for speech enhancement that effectively processes reverberant speech and outperforms existing methods like WPE.

## Key findings

- Outperforms WPE in objective quality metrics
- Generalizes from synthetic to real reverberant speech
- Effective across various room conditions and datasets

## Abstract

This paper proposes a speech enhancement method which exploits the high potential of residual connections in a Wide Residual Network architecture. This is supported on single dimensional convolutions computed alongside the time domain, which is a powerful approach to process contextually correlated representations through the temporal domain, such as speech feature sequences. We find the residual mechanism extremely useful for the enhancement task since the signal always has a linear shortcut and the non-linear path enhances it in several steps by adding or subtracting corrections. The enhancement capability of the proposal is assessed by objective quality metrics evaluated with simulated and real samples of reverberated speech signals. Results show that the proposal outperforms the state-of-the-art method called WPE, which is known to effectively reduce reverberation and greatly enhance the signal. The proposed model, trained with artificial synthesized reverberation data, was able to generalize to real room impulse responses for a variety of conditions (e.g. different room sizes, $RT_{60}$, near & far field). Furthermore, it achieves accuracy for real speech with reverberation from two different datasets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.05167/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1904.05167/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1904.05167/full.md

---
Source: https://tomesphere.com/paper/1904.05167