Domain-Incremental Learning for Audio Classification
Manjunath Mulimani, Annamaria Mesaros

TL;DR
This paper introduces a dynamic network architecture for domain-incremental audio classification that balances knowledge retention and adaptation across diverse acoustic datasets, improving accuracy in single-label and multi-label tasks.
Contribution
The work presents a novel dynamic network architecture that effectively manages domain-incremental learning in audio classification, addressing the challenge of catastrophic forgetting while adapting to new acoustic domains.
Findings
Achieves 71.9% accuracy in single-label classification across European and Korean datasets.
Attains 47.5% lwlrap in multi-label classification between Audioset and FSD50K.
Balances knowledge retention and adaptation effectively across multiple acoustic domains.
Abstract
In this work, we propose a method for domain-incremental learning for audio classification from a sequence of datasets recorded in different acoustic conditions. Fine-tuning a model on a sequence of evolving domains or datasets leads to forgetting of previously learned knowledge. On the other hand, freezing all the layers of the model leads to the model not adapting to the new domain. In this work, our novel dynamic network architecture keeps the shared homogeneous acoustic characteristics of domains, and learns the domain-specific acoustic characteristics in incremental steps. Our approach achieves a good balance between retaining the knowledge of previously learned domains and acquiring the knowledge of the new domain. We demonstrate the effectiveness of the proposed method on incremental learning of single-label classification of acoustic scenes from European cities and Korea, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis
