Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning
Ilyass Moummad, Romain Serizel, Nicolas Farrugia

TL;DR
This paper demonstrates that supervised contrastive learning with data augmentation can effectively pretrain feature extractors for bioacoustic few-shot sound event detection, achieving competitive results in the DCASE challenge.
Contribution
It introduces a supervised contrastive learning framework for pretraining bioacoustic representations that transfer well to few-shot detection tasks with limited labeled data.
Findings
Achieved 63.46% F-score on validation set
Ranked second in the DCASE challenge
Effective data augmentation strategies identified
Abstract
Deep learning has been widely used recently for sound event detection and classification. Its success is linked to the availability of sufficiently large datasets, possibly with corresponding annotations when supervised learning is considered. In bioacoustic applications, most tasks come with few labelled training data, because annotating long recordings is time consuming and costly. Therefore supervised learning is not the best suited approach to solve bioacoustic tasks. The bioacoustic community recasted the problem of sound event detection within the framework of few-shot learning, i.e. training a system with only few labeled examples. The few-shot bioacoustic sound event detection task in the DCASE challenge focuses on detecting events in long audio recordings given only five annotated examples for each class of interest. In this paper, we show that learning a rich feature extractor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Animal Vocal Communication and Behavior · Speech and Audio Processing
MethodsContrastive Learning
