Deep Active Audio Feature Learning in Resource-Constrained Environments

Md Mohaimenuzzaman; Christoph Bergmeir; Bernd Meyer

arXiv:2308.13201·cs.SD·July 2, 2024

Deep Active Audio Feature Learning in Resource-Constrained Environments

Md Mohaimenuzzaman, Christoph Bergmeir, Bernd Meyer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel active learning framework that integrates feature extraction and uses raw audio for bioacoustic classification, significantly reducing labeling effort in resource-constrained environments.

Contribution

It presents a new active learning approach that refines feature extraction iteratively and processes raw audio, improving label efficiency in bioacoustic deep learning models.

Findings

01

Reduces labeling effort by up to 66.7% on benchmark datasets.

02

Effective for both large DNN models and microcontroller-based systems.

03

Demonstrates practical benefits in conservation biology applications.

Abstract

The scarcity of labelled data makes training Deep Neural Network (DNN) models in bioacoustic applications challenging. In typical bioacoustics applications, manually labelling the required amount of data can be prohibitively expensive. To effectively identify both new and current classes, DNN models must continue to learn new features from a modest amount of fresh data. Active Learning (AL) is an approach that can help with this learning while requiring little labelling effort. Nevertheless, the use of fixed feature extraction approaches limits feature quality, resulting in underutilization of the benefits of AL. We describe an AL framework that addresses this issue by incorporating feature extraction into the AL loop and refining the feature extractor after each round of manual annotation. In addition, we use raw audio processing rather than spectrograms, which is a novel approach.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mohaimenz/deep_active_featl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Animal Vocal Communication and Behavior · Speech and Audio Processing