Audio Event Detection using Weakly Labeled Data

Anurag Kumar; Bhiksha Raj

arXiv:1605.02401·cs.SD·July 7, 2016

Audio Event Detection using Weakly Labeled Data

Anurag Kumar, Bhiksha Raj

PDF

TL;DR

This paper introduces a novel framework for acoustic event detection using only weakly labeled data, formulating it as a multiple instance learning problem and employing SVMs and neural networks to detect and temporally locate events.

Contribution

It presents the first approach to acoustic event detection with weak labels by formulating it as multiple instance learning and proposing two effective frameworks.

Findings

01

Supports weakly labeled data for event detection

02

Provides temporal localization of events

03

Reduces need for manual annotation

Abstract

Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data. However, the labels available for majority of multimedia data are generally weak and do not provide sufficient detail for such methods to be employed. In this paper we propose a framework for learning acoustic event detectors using only weakly labeled data. We first show that audio event detection using weak labels can be formulated as an Multiple Instance Learning problem. We then suggest two frameworks for solving multiple-instance learning, one based on support vector machines, and the other on neural networks. The proposed methods can help in removing the time consuming and expensive process of manually annotating data to facilitate fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.