DAT-CFTNet: Speech Enhancement for Cochlear Implant Recipients using Attention-based Dual-Path Recurrent Neural Network
Nursadul Mamun, John H.L. Hansen

TL;DR
This paper introduces DAT-CFTNet, a novel attention-based dual-path RNN integrated with CFTNet, significantly improving speech enhancement for cochlear implant users by effectively suppressing noise and preserving speech quality.
Contribution
It proposes a new attention mechanism within a dual-path RNN framework combined with CFTNet, enhancing speech intelligibility and quality for cochlear implant recipients in noisy environments.
Findings
Outperforms existing models like CFTNet and DCCRN in speech intelligibility and quality.
Effectively suppresses non-stationary noise and avoids musical artifacts.
Shows superior performance in cochlear implant speech enhancement studies.
Abstract
The human auditory system has the ability to selectively focus on key speech elements in an audio stream while giving secondary attention to less relevant areas such as noise or distortion within the background, dynamically adjusting its attention over time. Inspired by the recent success of attention models, this study introduces a dual-path attention module in the bottleneck layer of a concurrent speech enhancement network. Our study proposes an attention-based dual-path RNN (DAT-RNN), which, when combined with the modified complex-valued frequency transformation network (CFTNet), forms the DAT-CFTNet. This attention mechanism allows for precise differentiation between speech and noise in time-frequency (T-F) regions of spectrograms, optimizing both local and global context information processing in the CFTNet. Our experiments suggest that the DAT-CFTNet leads to consistently improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
