Realistic sources, receivers and walls improve the generalisability of   virtually-supervised blind acoustic parameter estimators

Prerak Srivastava; Antoine Deleforge; Emmanuel Vincent

arXiv:2207.09133·cs.SD·July 20, 2022·1 cites

Realistic sources, receivers and walls improve the generalisability of virtually-supervised blind acoustic parameter estimators

Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent

PDF

Open Access

TL;DR

This study demonstrates that increasing the realism of simulated training data, including source, receiver, and wall responses, significantly enhances the generalization of blind acoustic parameter estimators to real-world environments.

Contribution

The paper shows that training solely on highly realistic simulated data enables neural networks to accurately estimate acoustic parameters in real environments, reducing reliance on real annotated data.

Findings

01

Realistic simulation improves estimation accuracy on real data.

02

Layered realism in training data enhances generalizability.

03

Simulated data can replace real measurements for training.

Abstract

Blind acoustic parameter estimation consists in inferring the acoustic properties of an environment from recordings of unknown sound sources. Recent works in this area have utilized deep neural networks trained either partially or exclusively on simulated data, due to the limited availability of real annotated measurements. In this paper, we study whether a model purely trained using a fast image-source room impulse response simulator can generalize to real data. We present an ablation study on carefully crafted simulated training sets that account for different levels of realism in source, receiver and wall responses. The extent of realism is controlled by the sampling of wall absorption coefficients and by applying measured directivity patterns to microphones and sources. A state-of-the-art model trained on these datasets is evaluated on the task of jointly estimating the room's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Underwater Acoustics Research