Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Wen Wen, Qiang Zhou, Yu Xi, Haoyu Li, Ziqi Gong, Kai Yu

TL;DR
This paper introduces a novel dual-microphone speech enhancement method using a triple-steering spatial selection framework and a causal U-Net model, effectively improving speech quality in high noise, multi-speaker scenarios with low latency.
Contribution
It presents a new triple-steering spatial selection technique and a causal U-Net model that dynamically adjusts to target directions, enabling effective speech enhancement with only two microphones.
Findings
Outperforms existing methods in speech quality metrics.
Operates in real-time with minimal parameters.
Effective in extremely low SNR conditions.
Abstract
In multi-speaker scenarios, leveraging spatial features is essential for enhancing target speech. While with limited microphone arrays, developing a compact multi-channel speech enhancement system remains challenging, especially in extremely low signal-to-noise ratio (SNR) conditions. To tackle this issue, we propose a triple-steering spatial selection method, a flexible framework that uses three steering vectors to guide enhancement and determine the enhancement range. Specifically, we introduce a causal-directed U-Net (CDUNet) model, which takes raw multi-channel speech and the desired enhancement width as inputs. This enables dynamic adjustment of steering vectors based on the target direction and fine-tuning of the enhancement region according to the angular separation between the target and interference signals. Our model with only a dual microphone array, excels in both speech…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Adaptive Filtering Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Max Pooling · Concatenated Skip Connection · U-Net
