Deep learning-based filtering of cross-spectral matrices using   generative adversarial networks

Christof Puhle

arXiv:2502.21097·cs.SD·March 3, 2025

Deep learning-based filtering of cross-spectral matrices using generative adversarial networks

Christof Puhle

PDF

TL;DR

This paper introduces a deep learning approach using GANs to filter noise and distortions from microphone array data represented as cross-spectral matrices, improving sound processing accuracy.

Contribution

It presents a novel GAN-based method specifically designed for transforming cross-spectral matrices in sound data filtering tasks.

Findings

01

Effective noise reduction demonstrated in simulated environments

02

Model successfully performs multiple transformation tasks

03

Improved sound data quality over traditional filtering methods

Abstract

In this paper, we present a deep-learning method to filter out effects such as ambient noise, reflections, or source directivity from microphone array data represented as cross-spectral matrices. Specifically, we focus on a generative adversarial network (GAN) architecture designed to transform fixed-size cross-spectral matrices. Theses models were trained using sound pressure simulations of varying complexity developed for this purpose. Based on the results from applying these methods in a hyperparameter optimization of an auto-encoding task, we trained the optimized model to perform five distinct transformation tasks derived from different complexities inherent in our sound pressure simulations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.