Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection
Bruno Padovese, Fabio Frazao, Michael Dowd, Ruth Joy

TL;DR
This paper explores the use of deep generative models like VAEs, GANs, and diffusion models for data augmentation to improve marine mammal call detection, achieving significant performance gains over traditional methods.
Contribution
It demonstrates that deep generative models can enhance data augmentation strategies, leading to better detection accuracy of Southern Resident Killer Whales in complex acoustic environments.
Findings
Diffusion-based augmentation achieved the highest recall and F1-score.
Hybrid augmentation combining generative models and traditional methods yielded the best results.
All generative approaches improved classification performance over baseline.
Abstract
Automated detection and classification of marine mammals vocalizations is critical for conservation and management efforts but is hindered by limited annotated datasets and the acoustic complexity of real-world marine environments. Data augmentation has proven to be an effective strategy to address this limitation by increasing dataset diversity and improving model generalization without requiring additional field data. However, most augmentation techniques used to date rely on effective but relatively simple transformations, leaving open the question of whether deep generative models can provide additional benefits. In this study, we evaluate the potential of deep generative for data augmentation in marine mammal call detection including: Variational Autoencoders, Generative Adversarial Networks, and Denoising Diffusion Probabilistic Models. Using Southern Resident Killer Whale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarine animal studies overview · Animal Vocal Communication and Behavior · Underwater Acoustics Research
