UCIL: An Unsupervised Class Incremental Learning Approach for Sound   Event Detection

Yang Xiao; Rohan Kumar Das

arXiv:2407.03657·eess.AS·January 14, 2025

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

Yang Xiao, Rohan Kumar Das

PDF

Open Access

TL;DR

This paper introduces an unsupervised class-incremental learning method for sound event detection, enabling models to adapt to new sound classes over time while maintaining performance on previously learned classes.

Contribution

It presents a novel unsupervised framework with distillation, sample selection, and exemplar update strategies tailored for incremental sound event detection.

Findings

01

Effective in integrating new sound classes without performance loss

02

Provides insights into continual learning methods for real-world audio

03

Evaluates on DCASE 2023 dataset demonstrating practical applicability

Abstract

This work explores class-incremental learning (CIL) for sound event detection (SED), advancing adaptability towards real-world scenarios. CIL's success in domains like computer vision inspired our SED-tailored method, addressing the unique challenges of diverse and complex audio environments. Our approach employs an independent unsupervised learning framework with a distillation loss function to integrate new sound classes while preserving the SED model consistency across incremental tasks. We further enhance this framework with a sample selection strategy for unlabeled data and a balanced exemplar update mechanism, ensuring varied and illustrative sound representations. Evaluating various continual learning methods on the DCASE 2023 Task 4 dataset, we find that our research offers insights into each method's applicability for real-world SED systems that can have newly added sound…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Diverse Musicological Studies