Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics
Ben Williams, Bart van Merri\"enboer, Vincent Dumoulin, Jenny Hamer,, Eleni Triantafillou, Abram B. Fleishman, Matthew McKown, Jill E. Munger,, Aaron N. Rice, Ashlee Lillis, Clemency E. White, Catherine A. D. Hobbs, Tries, B. Razak, Kate E. Jones, Tom Denton

TL;DR
This paper demonstrates that combining bird, reef, and unrelated sounds during pretraining significantly improves transfer learning for marine bioacoustics, enabling effective ecological monitoring with minimal annotation.
Contribution
It introduces a novel pretraining strategy that leverages cross-domain audio mixing to enhance generalization in reef bioacoustics classification tasks.
Findings
Pretraining on bird sounds outperforms reef-only pretraining.
Cross-domain mixing maximizes reef sound classification accuracy.
SurfPerch network enables efficient marine sound analysis with minimal data.
Abstract
Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and compute costs limit the field's efficacy. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting its current applicability primarily to bird taxa. Here, we identify the optimum pretraining strategy for a data-deficient domain using coral reef bioacoustics. We assemble ReefSet, a large annotated library of reef sounds, though modest compared to bird libraries at 2% of the sample count. Through testing few-shot transfer learning performance, we observe that pretraining on bird audio provides notably superior generalizability compared to pretraining on ReefSet or unrelated audio alone. However, our key findings show that cross-domain mixing which leverages bird, reef and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUnderwater Acoustics Research · Marine animal studies overview · Animal Vocal Communication and Behavior
MethodsCorrelation Alignment for Deep Domain Adaptation · Lib
