Experiments on the DCASE Challenge 2016: Acoustic Scene Classification   and Sound Event Detection in Real Life Recording

Benjamin Elizalde; Anurag Kumar; Ankit Shah; Rohan Badlani; Emmanuel; Vincent; Bhiksha Raj; Ian Lane

arXiv:1607.06706·cs.SD·August 26, 2016·27 cites

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

Benjamin Elizalde, Anurag Kumar, Ankit Shah, Rohan Badlani, Emmanuel, Vincent, Bhiksha Raj, Ian Lane

PDF

Open Access

TL;DR

This paper reports on experiments for acoustic scene classification and sound event detection in real-life recordings, achieving significant improvements over baseline performance through feature and classifier optimization.

Contribution

The authors demonstrate enhanced methods for acoustic scene classification and sound event detection, surpassing baseline results in the DCASE 2016 challenge.

Findings

01

Achieved 78.9% accuracy in scene classification

02

Reduced segment-based error rate to 0.76 in sound event detection

03

Implemented feature and classifier optimizations

Abstract

In this paper we present our work on Task 1 Acoustic Scene Classi- fication and Task 3 Sound Event Detection in Real Life Recordings. Among our experiments we have low-level and high-level features, classifier optimization and other heuristics specific to each task. Our performance for both tasks improved the baseline from DCASE: for Task 1 we achieved an overall accuracy of 78.9% compared to the baseline of 72.6% and for Task 3 we achieved a Segment-Based Error Rate of 0.76 compared to the baseline of 0.91.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies