Performance optimizations on deep noise suppression models
Jerry Chee, Sebastian Braun, Vishak Gopal, Ross Cutler

TL;DR
This paper explores magnitude structured pruning and network re-parameterization to significantly speed up deep noise suppression models with minimal quality loss, enabling real-time audio processing.
Contribution
It introduces a method combining pruning and re-parameterization to optimize DNS models for faster inference without substantial quality degradation.
Findings
Achieved up to 7.25X inference speedup
Re-parameterization is the main factor for speedup
Pruning performs comparably to training smaller models
Abstract
We study the role of magnitude structured pruning as an architecture search to speed up the inference time of a deep noise suppression (DNS) model. While deep learning approaches have been remarkably successful in enhancing audio quality, their increased complexity inhibits their deployment in real-time applications. We achieve up to a 7.25X inference speedup over the baseline, with a smooth model performance degradation. Ablation studies indicate that our proposed network re-parameterization (i.e., size per layer) is the major driver of the speedup, and that magnitude structured pruning does comparably to directly training a model in the smaller size. We report inference speed because a parameter reduction does not necessitate speedup, and we measure model quality using an accurate non-intrusive objective speech quality metric.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Acoustic Wave Phenomena Research
MethodsPruning
