Adaptive Multi-scale Detection of Acoustic Events

Wenhao Ding; Liang He

arXiv:1911.06878·eess.AS·November 26, 2019

Adaptive Multi-scale Detection of Acoustic Events

Wenhao Ding, Liang He

PDF

Open Access

TL;DR

This paper introduces AdaMD, an adaptive multi-scale neural network approach for acoustic event detection that leverages time-frequency analysis and multi-resolution predictions to improve accuracy and noise robustness in complex environments.

Contribution

The paper proposes a novel adaptive multi-scale detection method using hourglass and GRU modules, with an adaptive training algorithm to enhance acoustic event detection performance.

Findings

01

Outperforms state-of-the-art in ER and F1-score on DCASE datasets

02

Demonstrates noise resistance in factory environments

03

Effective multi-scale prediction improves detection accuracy

Abstract

The goal of acoustic (or sound) events detection (AED or SED) is to predict the temporal position of target events in given audio segments. This task plays a significant role in safety monitoring, acoustic early warning and other scenarios. However, the deficiency of data and diversity of acoustic event sources make the AED task a tough issue, especially for prevalent data-driven methods. In this paper, we start by analyzing acoustic events according to their time-frequency domain properties, showing that different acoustic events have different time-frequency scale characteristics. Inspired by the analysis, we propose an adaptive multi-scale detection (AdaMD) method. By taking advantage of the hourglass neural network and gated recurrent unit (GRU) module, our AdaMD produces multiple predictions at different temporal and frequency resolutions. An adaptive training algorithm is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Anomaly Detection Techniques and Applications