FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time   Single-Channel Speech Enhancement

Xiang Hao; Xiangdong Su; Radu Horaud; Xiaofei Li

arXiv:2010.15508·eess.AS·July 4, 2024

FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement

Xiang Hao, Xiangdong Su, Radu Horaud, Xiaofei Li

PDF

5 Repos

TL;DR

This paper introduces FullSubNet, a real-time speech enhancement model that combines full-band and sub-band processing to leverage their complementary strengths, achieving superior performance on the DNS challenge dataset.

Contribution

The paper presents a novel sequential fusion of full-band and sub-band models with joint training for improved speech enhancement.

Findings

01

FullSubNet outperforms top methods in DNS Challenge

02

Full-band and sub-band features are complementary

03

The model effectively captures both global and local spectral information

Abstract

This paper proposes a full-band and sub-band fusion model, named as FullSubNet, for single-channel real-time speech enhancement. Full-band and sub-band refer to the models that input full-band and sub-band noisy spectral feature, output full-band and sub-band speech target, respectively. The sub-band model processes each frequency independently. Its input consists of one frequency and several context frequencies. The output is the prediction of the clean speech target for the corresponding frequency. These two types of models have distinct characteristics. The full-band model can capture the global spectral context and the long-distance cross-band dependencies. However, it lacks the ability to modeling signal stationarity and attending the local spectral pattern. The sub-band model is just the opposite. In our proposed FullSubNet, we connect a pure full-band model and a pure sub-band…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.