Overcoming label noise in audio event detection using sequential   labeling

Jae-Bin Kim; Seongkyu Mun; Myungwoo Oh; Soyeon Choe; Yong-Hyeok Lee,; Hyung-Min Park

arXiv:2007.05191·cs.SD·July 13, 2020

Overcoming label noise in audio event detection using sequential labeling

Jae-Bin Kim, Seongkyu Mun, Myungwoo Oh, Soyeon Choe, Yong-Hyeok Lee,, Hyung-Min Park

PDF

Open Access

TL;DR

This paper proposes a method to improve audio event detection by converting strong labels into sequential labels, reducing label noise, and enhancing semi-supervised learning performance.

Contribution

It introduces a novel approach to refine strong labels into sequential labels and integrates them into AED training, improving detection accuracy and robustness.

Findings

01

Sequential labels improve segment-based F-score.

02

Including sequential prediction mitigates label noise.

03

Method enhances semi-supervised AED performance.

Abstract

This paper addresses the noisy label issue in audio event detection (AED) by refining strong labels as sequential labels with inaccurate timestamps removed. In AED, strong labels contain the occurrence of a specific event and its timestamps corresponding to the start and end of the event in an audio clip. The timestamps depend on subjectivity of each annotator, and their label noise is inevitable. Contrary to the strong labels, weak labels indicate only the occurrence of a specific event. They do not have the label noise caused by the timestamps, but the time information is excluded. To fully exploit information from available strong and weak labels, we propose an AED scheme to train with sequential labels in addition to the given strong and weak labels after converting the strong labels into the sequential labels. Using sequential labels consistently improved the performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies