Enabling Early Audio Event Detection with Neural Networks

Huy Phan; Philipp Koch; Ian McLoughlin; Alfred Mertins

arXiv:1712.02116·cs.SD·April 9, 2019

Enabling Early Audio Event Detection with Neural Networks

Huy Phan, Philipp Koch, Ian McLoughlin, Alfred Mertins

PDF

TL;DR

This paper introduces a novel neural network-based system for early audio event detection, utilizing dual loss functions and a new inference step to recognize ongoing events during their initial stages, achieving state-of-the-art results.

Contribution

The paper proposes a new early detection methodology combining tailored-loss DNNs and a novel inference step, enhancing real-time audio event recognition capabilities.

Findings

01

Achieves state-of-the-art detection performance on ITC-Irst database.

02

Partial events enable early detection comparable to full events.

03

Monotonicity of the detection function is theoretically validated.

Abstract

This paper presents a methodology for early detection of audio events from audio streams. Early detection is the ability to infer an ongoing event during its initial stage. The proposed system consists of a novel inference step coupled with dual parallel tailored-loss deep neural networks (DNNs). The DNNs share a similar architecture except for their loss functions, i.e. weighted loss and multitask loss, which are designed to efficiently cope with issues common to audio event detection. The inference step is newly introduced to make use of the network outputs for recognizing ongoing events. The monotonicity of the detection function is required for reliable early detection, and will also be proved. Experiments on the ITC-Irst database show that the proposed system achieves state-of-the-art detection performance. Furthermore, even partial events are sufficient to achieve good performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.