Efficient Sound Field Reconstruction with Conditional Invertible Neural Networks
Xenofon Karakonstantis, Efren Fernandez-Grande, Peter Gerstoft

TL;DR
This paper presents a novel approach using conditional invertible neural networks to efficiently reconstruct sound fields and room impulse responses, incorporating uncertainty estimates and reducing data requirements.
Contribution
The study introduces a CINN-based method for sound field reconstruction that balances accuracy and efficiency while handling uncertainties and sparse data.
Findings
Achieves accurate sound field reconstruction with less data.
Provides uncertainty estimates for sound field predictions.
Outperforms traditional Bayesian methods in efficiency.
Abstract
In this study, we introduce a method for estimating sound fields in reverberant environments using a conditional invertible neural network (CINN). Sound field reconstruction can be hindered by experimental errors, limited spatial data, model mismatches, and long inference times, leading to potentially flawed and prolonged characterizations. Further, the complexity of managing inherent uncertainties often escalates computational demands or is neglected in models. Our approach seeks to balance accuracy and computational efficiency, while incorporating uncertainty estimates to tailor reconstructions to specific needs. By training a CINN with Monte Carlo simulations of random wave fields, our method reduces the dependency on extensive datasets and enables inference from sparse experimental data. The CINN proves versatile at reconstructing Room Impulse Responses (RIRs), by acting either as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Image and Signal Denoising Methods
