AudioCIL: A Python Toolbox for Audio Class-Incremental Learning with   Multiple Scenes

Qisheng Xu; Yulin Sun; Yi Su; Qian Zhu; Xiaoyi Tan; Hongyu Wen; Zijian; Gao; Kele Xu; Yong Dou; Dawei Feng

arXiv:2412.11907·cs.SD·December 19, 2024

AudioCIL: A Python Toolbox for Audio Class-Incremental Learning with Multiple Scenes

Qisheng Xu, Yulin Sun, Yi Su, Qian Zhu, Xiaoyi Tan, Hongyu Wen, Zijian, Gao, Kele Xu, Yong Dou, Dawei Feng

PDF

Open Access 1 Repo

TL;DR

AudioCIL is a Python toolbox designed to facilitate audio class-incremental learning, enabling models to adapt to new audio classes over time without forgetting previous knowledge, thus better reflecting real-world audio environment dynamics.

Contribution

The paper introduces AudioCIL, a new Python toolbox that supports audio class-incremental learning, bridging the gap between static datasets and real-world streaming audio scenarios.

Findings

01

Provides a flexible framework for incremental learning in audio tasks

02

Enables models to learn new classes without forgetting old ones

03

Supports research in real-world audio environment adaptation

Abstract

Deep learning, with its robust aotomatic feature extraction capabilities, has demonstrated significant success in audio signal processing. Typically, these methods rely on static, pre-collected large-scale datasets for training, performing well on a fixed number of classes. However, the real world is characterized by constant change, with new audio classes emerging from streaming or temporary availability due to privacy. This dynamic nature of audio environments necessitates models that can incrementally learn new knowledge for new classes without discarding existing information. Introducing incremental learning to the field of audio signal processing, i.e., Audio Class-Incremental Learning (AuCIL), is a meaningful endeavor. We propose such a toolbox named AudioCIL to align audio signal processing algorithms with real-world scenarios and strengthen research in audio class-incremental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

colaudiolab/audiocil
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis