DeepForestSound: a multi-species automatic detector for passive acoustic monitoring in African tropical forests, a case study in Kibale National Park
Gabriel Dubus, Th\'eau d'Audiffret, Claire Auger, Rapha\"el Cornette, Sylvain Haupert, Innocent Kasekendi, Raymond Katumba, Hugo Magaldi, Lise Pernel, Harold Rugonge, J\'er\^ome Sueur, John Justice Tibesigwa, Sabrina Krief

TL;DR
DeepForestSound is a semi-supervised, multi-species acoustic detection model tailored for African tropical forests, demonstrating high accuracy and generalization across taxa and time in biodiversity monitoring.
Contribution
The paper introduces a novel semi-supervised pipeline with low-rank adaptation fine-tuning for multi-species detection in tropical forests, outperforming existing tools.
Findings
DFS achieves AP of 0.964 for primates and 0.961 for elephants.
LoRA fine-tuning outperforms linear probing across taxa.
Region-specific training improves detection in complex tropical environments.
Abstract
Passive Acoustic Monitoring (PAM) is widely used for biodiversity assessment. Its application in African tropical forests is limited by scarce annotated data, reducing the performance of general-purpose ecoacoustic models on underrepresented taxa. In this study, we introduce DeepForestSound (DFS), a multi-species automatic detection model designed for PAM in African tropical forests. DFS relies on a semi-supervised pipeline combining clustering of unannotated recordings with manual validation, followed by supervised fine-tuning of an Audio Spectrogram Transformer (AST) using low-rank adaptation, which is compared to a frozen-backbone linear baseline (DFS-Linear). The framework supports the detection of multiple taxonomic groups, including birds, primates, and elephants, from long-term acoustic recordings. DFS was trained on acoustic data collected in the Sebitoli area, in Kibale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
