AVEX: What Matters for Animal Vocalization Encoding
Marius Miron, David Robinson, Milad Alizadeh, Ellen Gilsenan-McMahon, Gagan Narula, Emmanuel Chemla, Maddie Cusimano, Felix Effenberger, Masato Hagiwara, Benjamin Hoffman, Sara Keen, Diane Kim, Jane Lawton, Jen-Yu Liu, Aza Raskin, Olivier Pietquin, Matthieu Geist

TL;DR
This paper conducts a comprehensive empirical study on bioacoustic encoders, emphasizing data diversity, model architectures, and evaluation breadth, to improve animal vocalization representations for various tasks.
Contribution
It introduces a large-scale analysis of bioacoustic encoder training, highlighting key factors like data diversity and training strategies that enhance performance across multiple species and tasks.
Findings
Self-supervised pre-training plus supervised fine-tuning yields best results.
Data diversity in training improves out-of-distribution performance.
Model checkpoints will be publicly released for research use.
Abstract
Bioacoustics, the study of sounds produced by living organisms, plays a vital role in conservation, biodiversity monitoring, and behavioral studies. Many tasks in this field, such as species, individual, and behavior classification and detection, are well-suited to machine learning. However, they often suffer from limited annotated data, highlighting the need for a general-purpose bioacoustic encoder capable of extracting useful representations for diverse downstream tasks. Such encoders have been proposed before, but are often limited in scope due to a focus on a narrow range of species (typically birds), and a reliance on a single model architecture or training paradigm. Moreover, they are usually evaluated on a small set of tasks and datasets. In this work, we present a large-scale empirical study that covers aspects of bioacoustics that are relevant to research but have previously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗EarthSpeciesProject/esp-aves2-sl-beats-allmodel· ♡ 1♡ 1
- 🤗EarthSpeciesProject/esp-aves2-eat-allmodel
- 🤗EarthSpeciesProject/esp-aves2-eat-biomodel
- 🤗EarthSpeciesProject/esp-aves2-effnetb0-allmodel
- 🤗EarthSpeciesProject/esp-aves2-effnetb0-audiosetmodel
- 🤗EarthSpeciesProject/esp-aves2-effnetb0-biomodel
- 🤗EarthSpeciesProject/esp-aves2-naturelm-audio-v1-beatsmodel
- 🤗EarthSpeciesProject/esp-aves2-sl-beats-biomodel· ♡ 1♡ 1
- 🤗EarthSpeciesProject/esp-aves2-sl-eat-all-ssl-allmodel
- 🤗EarthSpeciesProject/esp-aves2-sl-eat-bio-ssl-allmodel
Videos
