Domain-Incremental Learning for Audio Classification

Manjunath Mulimani; Annamaria Mesaros

arXiv:2412.17424·eess.AS·December 24, 2024·ICASSP

Domain-Incremental Learning for Audio Classification

Manjunath Mulimani, Annamaria Mesaros

PDF

Open Access

TL;DR

This paper introduces a dynamic network architecture for domain-incremental audio classification that balances knowledge retention and adaptation across diverse acoustic datasets, improving accuracy in single-label and multi-label tasks.

Contribution

The work presents a novel dynamic network architecture that effectively manages domain-incremental learning in audio classification, addressing the challenge of catastrophic forgetting while adapting to new acoustic domains.

Findings

01

Achieves 71.9% accuracy in single-label classification across European and Korean datasets.

02

Attains 47.5% lwlrap in multi-label classification between Audioset and FSD50K.

03

Balances knowledge retention and adaptation effectively across multiple acoustic domains.

Abstract

In this work, we propose a method for domain-incremental learning for audio classification from a sequence of datasets recorded in different acoustic conditions. Fine-tuning a model on a sequence of evolving domains or datasets leads to forgetting of previously learned knowledge. On the other hand, freezing all the layers of the model leads to the model not adapting to the new domain. In this work, our novel dynamic network architecture keeps the shared homogeneous acoustic characteristics of domains, and learns the domain-specific acoustic characteristics in incremental steps. Our approach achieves a good balance between retaining the knowledge of previously learned domains and acquiring the knowledge of the new domain. We demonstrate the effectiveness of the proposed method on incremental learning of single-label classification of acoustic scenes from European cities and Korea, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis