Explainable DNN-based Beamformer with Postfilter
Adi Cohen, Daniel Wong, Jung-Suk Lee, Sharon Gannot

TL;DR
This paper presents an explainable deep neural network-based beamformer with a postfilter that combines spatial filtering and attention mechanisms, providing improved speech enhancement with better interpretability and no need for prior speaker activity knowledge.
Contribution
It introduces a novel two-stage beamforming approach with an integrated attention mechanism and spatial analysis, advancing explainability and performance in multichannel speech processing.
Findings
Superior speech enhancement results without prior speaker activity knowledge
Effective integration of attention mechanism improves noise suppression
Thorough spatial analysis enhances understanding of network performance
Abstract
This paper introduces an explainable DNN-based beamformer with a postfilter (ExNet-BF+PF) for multichannel signal processing. Our approach combines the U-Net network with a beamformer structure to address this problem. The method involves a two-stage processing pipeline. In the first stage, time-invariant weights are applied to construct a multichannel spatial filter, namely a beamformer. In the second stage, a time-varying single-channel post-filter is applied at the beamformer output. Additionally, we incorporate an attention mechanism inspired by its successful application in noisy and reverberant environments to improve speech enhancement further. Furthermore, our study fills a gap in the existing literature by conducting a thorough spatial analysis of the network's performance. Specifically, we examine how the network utilizes spatial information during processing. This analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Attention Is All You Need · Concatenated Skip Connection · Convolution · Max Pooling · U-Net
