Sub-Band Knowledge Distillation Framework for Speech Enhancement

Xiang Hao; Shixue Wen; Xiangdong Su; Yun Liu; Guanglai Gao; Xiaofei; Li

arXiv:2005.14435·eess.AS·October 30, 2020

Sub-Band Knowledge Distillation Framework for Speech Enhancement

Xiang Hao, Shixue Wen, Xiangdong Su, Yun Liu, Guanglai Gao, Xiaofei, Li

PDF

TL;DR

This paper introduces a sub-band knowledge distillation framework for speech enhancement that improves a general model's performance by leveraging specialized teacher models for each sub-band without increasing complexity.

Contribution

It proposes a novel sub-band knowledge distillation approach that enhances speech enhancement models by using multiple teacher models for sub-bands, outperforming full-band models.

Findings

01

Student model performance exceeds full-band models.

02

Teacher-guided training significantly improves speech enhancement.

03

Method achieves better results with fewer parameters.

Abstract

In single-channel speech enhancement, methods based on full-band spectral features have been widely studied. However, only a few methods pay attention to non-full-band spectral features. In this paper, we explore a knowledge distillation framework based on sub-band spectral mapping for single-channel speech enhancement. Specifically, we divide the full frequency band into multiple sub-bands and pre-train an elite-level sub-band enhancement model (teacher model) for each sub-band. These teacher models are dedicated to processing their own sub-bands. Next, under the teacher models' guidance, we train a general sub-band enhancement model (student model) that works for all sub-bands. Without increasing the number of model parameters and computational complexity, the student model's performance is further improved. To evaluate our proposed method, we conducted a large number of experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation