PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement
Zizhen Lin, Junyu Wang, Ruili Li, Fei Shen, Xi Xuan

TL;DR
PrimeK-Net introduces a multi-scale spectral learning approach using group prime-kernel convolutions, achieving state-of-the-art speech enhancement performance with high efficiency and fewer parameters.
Contribution
The paper presents PrimeK-Net, a novel CNN-based framework with group prime-kernel convolutions and efficient modules for improved speech enhancement.
Findings
Achieves SOTA PESQ score of 3.61 on VoiceBank+Demand dataset.
Uses only 1.41 million parameters, demonstrating efficiency.
Outperforms existing methods in spectral feature extraction and computational cost.
Abstract
Single-channel speech enhancement is a challenging ill-posed problem focused on estimating clean speech from degraded signals. Existing studies have demonstrated the competitive performance of combining convolutional neural networks (CNNs) with Transformers in speech enhancement tasks. However, existing frameworks have not sufficiently addressed computational efficiency and have overlooked the natural multi-scale distribution of the spectrum. Additionally, the potential of CNNs in speech enhancement has yet to be fully realized. To address these issues, this study proposes a Deep Separable Dilated Dense Block (DSDDB) and a Group Prime Kernel Feedforward Channel Attention (GPFCA) module. Specifically, the DSDDB introduces higher parameter and computational efficiency to the Encoder/Decoder of existing frameworks. The GPFCA module replaces the position of the Conformer, extracting deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Adaptive Filtering Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Attention Is All You Need · Concatenated Skip Connection · Batch Normalization · Convolution · Dense Connections · Feedforward Network · Dense Block
