AVES: Animal Vocalization Encoder based on Self-Supervision

Masato Hagiwara

arXiv:2210.14493·cs.SD·October 27, 2022·1 cites

AVES: Animal Vocalization Encoder based on Self-Supervision

Masato Hagiwara

PDF

Open Access 3 Repos

TL;DR

AVES is a self-supervised transformer model for encoding animal vocalizations, leveraging unannotated data to outperform supervised models in bioacoustics classification and detection tasks.

Contribution

The paper introduces AVES, a novel self-supervised transformer-based model that effectively encodes animal vocalizations using unannotated data, improving bioacoustic task performance.

Findings

01

AVES outperforms supervised baseline models.

02

Curating small task-specific datasets enhances model quality.

03

Open-source availability of AVES models.

Abstract

The lack of annotated training data in bioacoustics hinders the use of large-scale neural network models trained in a supervised way. In order to leverage a large amount of unannotated audio data, we propose AVES (Animal Vocalization Encoder based on Self-Supervision), a self-supervised, transformer-based audio representation model for encoding animal vocalizations. We pretrain AVES on a diverse set of unannotated audio datasets and fine-tune them for downstream bioacoustics tasks. Comprehensive experiments with a suite of classification and detection tasks have shown that AVES outperforms all the strong baselines and even the supervised "topline" models trained on annotated audio classification datasets. The results also suggest that curating a small training subset related to downstream tasks is an efficient way to train high-quality audio representation models. We open-source our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnimal Vocal Communication and Behavior · Marine animal studies overview · Bat Biology and Ecology Studies