Deep learning based spatial aliasing reduction in beamforming for audio capture
Mateusz Guzik, Giulio Cengarle, Daniel Arteaga

TL;DR
This paper introduces a deep learning method using a U-Net architecture to reduce spatial aliasing in beamforming, significantly improving audio capture quality in multi-microphone systems.
Contribution
It presents a novel deep learning approach for spatial aliasing mitigation in beamforming, employing a U-Net to predict de-aliasing filters for enhanced spatial audio capture.
Findings
Significant objective and perceptual improvements over conventional beamforming.
Effective reduction of spatial aliasing in stereo and Ambisonics scenarios.
Demonstrates deep learning's potential in multi-microphone spatial audio enhancement.
Abstract
Spatial aliasing affects spaced microphone arrays, causing directional ambiguity above certain frequencies, degrading spatial and spectral accuracy of beamformers. Given the limitations of conventional signal processing and the scarcity of deep learning approaches to spatial aliasing mitigation, we propose a novel approach using a U-Net architecture to predict a signal-dependent de-aliasing filter, which reduces aliasing in conventional beamforming for spatial capture. Two types of multichannel filters are considered, one which treats the channels independently and a second one that models cross-channel dependencies. The proposed approach is evaluated in two common spatial capture scenarios: stereo and first-order Ambisonics. The results indicate a very significant improvement, both objective and perceptual, with respect to conventional beamforming. This work shows the potential of deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Advanced Data Compression Techniques
MethodsConcatenated Skip Connection · Max Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · U-Net
