Semi-Supervised Audio Classification with Partially Labeled Data

Siddharth Gururani; Alexander Lerch

arXiv:2111.12761·cs.SD·November 29, 2021

Semi-Supervised Audio Classification with Partially Labeled Data

Siddharth Gururani, Alexander Lerch

PDF

Open Access

TL;DR

This paper introduces two semi-supervised learning methods for audio classification that effectively utilize partially labeled datasets, addressing the challenge of missing labels and improving model performance without requiring full annotations.

Contribution

It proposes a label enhancement approach with a teacher-student process and applies the mean teacher algorithm to handle partially labeled audio data, advancing semi-supervised techniques.

Findings

01

Proper handling of missing labels significantly improves classification accuracy.

02

Using semi-supervised methods with partially labeled data outperforms traditional supervised approaches.

03

Different strategies leveraging limited labels offer varying benefits depending on data conditions.

Abstract

Audio classification has seen great progress with the increasing availability of large-scale datasets. These large datasets, however, are often only partially labeled as collecting full annotations is a tedious and expensive process. This paper presents two semi-supervised methods capable of learning with missing labels and evaluates them on two publicly available, partially labeled datasets. The first method relies on label enhancement by a two-stage teacher-student learning process, while the second method utilizes the mean teacher semi-supervised learning algorithm. Our results demonstrate the impact of improperly handling missing labels and compare the benefits of using different strategies leveraging data with few labels. Methods capable of learning with partially labeled data have the potential to improve models for audio classification by utilizing even larger amounts of data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Water Systems and Optimization