Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition
Junhao Xu, Jianwei Yu, Xunying Liu, Helen Meng

TL;DR
This paper introduces novel mixed precision quantization methods for DNNs in overlapped speech separation, automatically optimizing bit-widths for different model components to improve performance and reduce model size.
Contribution
The paper proposes three innovative techniques for automatically learning mixed precision settings in DNNs for speech separation, addressing limitations of uniform quantization methods.
Findings
Mixed precision quantization outperforms uniform precision baselines.
Significant WER reduction of up to 2.88% absolute.
Improved SI-SNR and PESQ scores with lower bit-width models.
Abstract
Recognition of overlapped speech has been a highly challenging task to date. State-of-the-art multi-channel speech separation system are becoming increasingly complex and expensive for practical applications. To this end, low-bit neural network quantization provides a powerful solution to dramatically reduce their model size. However, current quantization methods are based on uniform precision and fail to account for the varying performance sensitivity at different model components to quantization errors. In this paper, novel mixed precision DNN quantization methods are proposed by applying locally variable bit-widths to individual TCN components of a TF masking based multi-channel speech separation system. The optimal local precision settings are automatically learned using three techniques. The first two approaches utilize quantization sensitivity metrics based on either the mean…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing
