A hybrid parametric-deep learning approach for sound event localization   and detection

Andres Perez-Lopez; Eduardo Fonseca; Xavier Serra

arXiv:1908.10133·cs.SD·August 28, 2019

A hybrid parametric-deep learning approach for sound event localization and detection

Andres Perez-Lopez, Eduardo Fonseca, Xavier Serra

PDF

1 Repo

TL;DR

This paper presents a hybrid approach combining parametric spatial audio analysis with deep learning for sound event localization and detection, achieving significantly improved localization accuracy in a competitive challenge.

Contribution

It introduces a novel hybrid method that integrates parametric and deep learning techniques, reducing localization error by 2.6 times compared to the baseline.

Findings

01

Localization error reduced by a factor of 2.6

02

Performance comparable to baseline in detection

03

Effective integration of parametric and deep learning methods

Abstract

This work describes and discusses an algorithm submitted to the Sound Event Localization and Detection Task of DCASE2019 Challenge. The proposed methodology relies on parametric spatial audio analysis for source localization and detection, combined with a deep learning-based monophonic event classifier. The evaluation of the proposed algorithm yields overall results comparable to the baseline system. The main highlight is a reduction of the localization error on the evaluation dataset by a factor of 2.6, compared with the baseline performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andresperezlopez/DCASE2019_task3
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.