A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection
Archontis Politis, Sharath Adavanne, Tuomas Virtanen

TL;DR
This paper introduces a complex dataset of reverberant spatial sound scenes with moving sources for sound event localization and detection, along with a baseline neural network method for benchmarking.
Contribution
It provides a new, more diverse dataset with moving sound sources for the SELD task and an improved baseline model for evaluation.
Findings
The dataset includes dynamic and static sound events in realistic acoustic environments.
The baseline model achieves benchmark scores on the new dataset.
The dataset facilitates research on sound source movement and localization.
Abstract
This report presents the dataset and the evaluation setup of the Sound Event Localization & Detection (SELD) task for the DCASE 2020 Challenge. The SELD task refers to the problem of trying to simultaneously classify a known set of sound event classes, detect their temporal activations, and estimate their spatial directions or locations while they are active. To train and test SELD systems, datasets of diverse sound events occurring under realistic acoustic conditions are needed. Compared to the previous challenge, a significantly more complex dataset was created for DCASE 2020. The two key differences are a more diverse range of acoustical conditions, and dynamic conditions, i.e. moving sources. The spatial sound scenes are created using real room impulse responses captured in a continuous manner with a slowly moving excitation source. Both static and moving sound events are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
