A strongly annotated passive acoustic dataset for tropical bird monitoring
Daniela Ruiz, Juan Sebasti\'an Ulloa, Zhongqi Miao, Nicol\'as Betancourt, Maria Paula Toro-G\'omez, Andr\'es Hern\'andez, Bruno Demuro, Eliana Barona-Cort\'es, Angela Mendoza-Henao, Andr\'es Sierra-Ricaurte, Sebasti\'an P\'erez-Pe\~na, Rahul Dodhia, Pablo Arbel\'aez

TL;DR
This paper introduces PteroSet, a comprehensive, strongly annotated tropical bird sound dataset designed to advance supervised machine learning for biodiversity monitoring in complex ecosystems.
Contribution
The creation and release of PteroSet, a large, richly annotated tropical bird vocalization dataset with a COCO-inspired schema, serving as a benchmark for machine learning research.
Findings
PteroSet contains 563 recordings and 15,372 annotations across 168 species.
The dataset reveals key characteristics of tropical soundscapes, such as acoustic co-occurrence.
A baseline deep learning model for bird detection demonstrates the dataset's practical utility.
Abstract
Passive acoustic monitoring enables continuous, non-invasive biodiversity assessment across diverse ecosystems. The scale of these datasets has driven the adoption of machine learning, with supervised approaches showing strong performance. However, supervised methods require time-resolved annotated datasets, which remain scarce, especially in complex tropical soundscapes. We present PteroSet, a curated dataset of strongly annotated Neotropical bird vocalizations recorded in Puerto Asis (Putumayo) and Pivijay (Magdalena), Colombia, between 2023 and 2025. The dataset comprises 563 recordings (73.62 h) and 15,372 time-frequency annotations, including 6,702 events identified to the species level across 168 species. We release the annotations in a COCO-inspired JSON schema that unifies audio files, taxonomic categories, and labels for machine learning workflows. Beyond providing annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
