Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks
Yuren Sun, Tatiana Midori Maeda, Claudia Solis-Lemus, Daniel, Pimentel-Alarcon, Zuzana Burivalova

TL;DR
This study demonstrates that transfer learning and data augmentation enable CNN-based classification of animal sounds in tropical rainforests with limited training data, facilitating scalable biodiversity monitoring.
Contribution
It shows that CNNs with transfer learning and data augmentation can accurately classify animal call types even with small datasets, aiding conservation efforts.
Findings
Data augmentation significantly improves accuracy at small sample sizes.
Transfer learning reduces the need for large training datasets.
Open-source model can be retrained with basic programming skills.
Abstract
To protect tropical forest biodiversity, we need to be able to detect it reliably, cheaply, and at scale. Automated species detection from passively recorded soundscapes via machine-learning approaches is a promising technique towards this goal, but it is constrained by the necessity of large training data sets. Using soundscapes from a tropical forest in Borneo and a Convolutional Neural Network model (CNN) created with transfer learning, we investigate i) the minimum viable training data set size for accurate prediction of call types ('sonotypes'), and ii) the extent to which data augmentation can overcome the issue of small training data sets. We found that even relatively high sample sizes (> 80 per call type) lead to mediocre accuracy, which however improves significantly with data augmentation, including at extremely small sample sizes, regardless of taxonomic group or call…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Marine animal studies overview · Music and Audio Processing
