AudioCIL: A Python Toolbox for Audio Class-Incremental Learning with Multiple Scenes
Qisheng Xu, Yulin Sun, Yi Su, Qian Zhu, Xiaoyi Tan, Hongyu Wen, Zijian, Gao, Kele Xu, Yong Dou, Dawei Feng

TL;DR
AudioCIL is a Python toolbox designed to facilitate audio class-incremental learning, enabling models to adapt to new audio classes over time without forgetting previous knowledge, thus better reflecting real-world audio environment dynamics.
Contribution
The paper introduces AudioCIL, a new Python toolbox that supports audio class-incremental learning, bridging the gap between static datasets and real-world streaming audio scenarios.
Findings
Provides a flexible framework for incremental learning in audio tasks
Enables models to learn new classes without forgetting old ones
Supports research in real-world audio environment adaptation
Abstract
Deep learning, with its robust aotomatic feature extraction capabilities, has demonstrated significant success in audio signal processing. Typically, these methods rely on static, pre-collected large-scale datasets for training, performing well on a fixed number of classes. However, the real world is characterized by constant change, with new audio classes emerging from streaming or temporary availability due to privacy. This dynamic nature of audio environments necessitates models that can incrementally learn new knowledge for new classes without discarding existing information. Introducing incremental learning to the field of audio signal processing, i.e., Audio Class-Incremental Learning (AuCIL), is a meaningful endeavor. We propose such a toolbox named AudioCIL to align audio signal processing algorithms with real-world scenarios and strengthen research in audio class-incremental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis
