Deep Active Audio Feature Learning in Resource-Constrained Environments
Md Mohaimenuzzaman, Christoph Bergmeir, Bernd Meyer

TL;DR
This paper introduces a novel active learning framework that integrates feature extraction and uses raw audio for bioacoustic classification, significantly reducing labeling effort in resource-constrained environments.
Contribution
It presents a new active learning approach that refines feature extraction iteratively and processes raw audio, improving label efficiency in bioacoustic deep learning models.
Findings
Reduces labeling effort by up to 66.7% on benchmark datasets.
Effective for both large DNN models and microcontroller-based systems.
Demonstrates practical benefits in conservation biology applications.
Abstract
The scarcity of labelled data makes training Deep Neural Network (DNN) models in bioacoustic applications challenging. In typical bioacoustics applications, manually labelling the required amount of data can be prohibitively expensive. To effectively identify both new and current classes, DNN models must continue to learn new features from a modest amount of fresh data. Active Learning (AL) is an approach that can help with this learning while requiring little labelling effort. Nevertheless, the use of fixed feature extraction approaches limits feature quality, resulting in underutilization of the benefits of AL. We describe an AL framework that addresses this issue by incorporating feature extraction into the AL loop and refining the feature extractor after each round of manual annotation. In addition, we use raw audio processing rather than spectrograms, which is a novel approach.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Animal Vocal Communication and Behavior · Speech and Audio Processing
