VAST : The Virtual Acoustic Space Traveler Dataset

Cl\'ement Gaultier (PANAMA); Saurabh Kataria (PANAMA; IIT Kanpur),; Antoine Deleforge (PANAMA)

arXiv:1612.06287·cs.SD·December 20, 2016

VAST : The Virtual Acoustic Space Traveler Dataset

Cl\'ement Gaultier (PANAMA), Saurabh Kataria (PANAMA, IIT Kanpur),, Antoine Deleforge (PANAMA)

PDF

Open Access

TL;DR

This paper presents VAST, a large-scale virtual acoustic dataset for training sound source localization models, demonstrating that models trained on this dataset generalize well to real-world data and improve upon traditional methods.

Contribution

Introduction of the VAST dataset for virtual acoustic space traveling, enabling effective learning of sound localization mappings from simulated to real environments.

Findings

01

Models trained on VAST generalize to real data

02

VAST dataset improves sound localization accuracy

03

Overcomes limitations of traditional binaural localization methods

Abstract

This paper introduces a new paradigm for sound source lo-calization referred to as virtual acoustic space traveling (VAST) and presents a first dataset designed for this purpose. Existing sound source localization methods are either based on an approximate physical model (physics-driven) or on a specific-purpose calibration set (data-driven). With VAST, the idea is to learn a mapping from audio features to desired audio properties using a massive dataset of simulated room impulse responses. This virtual dataset is designed to be maximally representative of the potential audio scenes that the considered system may be evolving in, while remaining reasonably compact. We show that virtually-learned mappings on this dataset generalize to real data, overcoming some intrinsic limitations of traditional binaural sound localization methods based on time differences of arrival.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Music and Audio Processing