SepPrune: Structured Pruning for Efficient Deep Speech Separation

Yuqi Li; Kai Li; Xin Yin; Zhifei Yang; Junhao Dong; Zeyu Dong; Chuanguang Yang; Yingli Tian; Yao Lu

arXiv:2505.12079·cs.SD·May 20, 2025

SepPrune: Structured Pruning for Efficient Deep Speech Separation

Yuqi Li, Kai Li, Xin Yin, Zhifei Yang, Junhao Dong, Zeyu Dong, Chuanguang Yang, Yingli Tian, Yao Lu

PDF

Open Access 1 Repo

TL;DR

SepPrune introduces a structured pruning framework tailored for deep speech separation models, significantly reducing computational costs while maintaining high separation performance, enabling faster training and real-time application suitability.

Contribution

It is the first structured pruning method specifically designed for deep speech separation models, combining differentiable masking with channel pruning for efficiency.

Findings

01

Pruned models recover 85% of original performance with minimal fine-tuning.

02

Achieves 36× faster convergence compared to training from scratch.

03

Outperforms existing pruning methods in speech separation tasks.

Abstract

Although deep learning has substantially advanced speech separation in recent years, most existing studies continue to prioritize separation quality while overlooking computational efficiency, an essential factor for low-latency speech processing in real-time applications. In this paper, we propose SepPrune, the first structured pruning framework specifically designed to compress deep speech separation models and reduce their computational cost. SepPrune begins by analyzing the computational structure of a given model to identify layers with the highest computational burden. It then introduces a differentiable masking strategy to enable gradient-driven channel selection. Based on the learned masks, SepPrune prunes redundant channels and fine-tunes the remaining parameters to recover performance. Extensive experiments demonstrate that this learnable pruning paradigm yields substantial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

itsnotacie/sepprune
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Voice and Speech Disorders

MethodsPruning