Segment-level Metric Learning for Few-shot Bioacoustic Event Detection
Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D., Plumbley

TL;DR
This paper introduces a segment-level few-shot learning framework for bioacoustic event detection that leverages both positive and negative examples and uses transductive inference, significantly improving performance over baselines.
Contribution
The study proposes a novel segment-level metric learning approach utilizing negative events and transductive inference to enhance few-shot bioacoustic event detection.
Findings
Achieved an F-measure of 62.73 on DCASE 2022 validation set.
Outperformed the baseline prototypical network by a large margin.
Ranked 2nd in the DCASE2022-T5 challenge.
Abstract
Few-shot bioacoustic event detection is a task that detects the occurrence time of a novel sound given a few examples. Previous methods employ metric learning to build a latent space with the labeled part of different sound classes, also known as positive events. In this study, we propose a segment-level few-shot learning framework that utilizes both the positive and negative events during model optimization. Training with negative events, which are larger in volume than positive events, can increase the generalization ability of the model. In addition, we use transductive inference on the validation set during training for better adaptation to novel classes. We conduct ablation studies on our proposed method with different setups on input features, training data, and hyper-parameters. Our final system achieves an F-measure of 62.73 on the DCASE 2022 challenge task 5 (DCASE2022-T5)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Music and Audio Processing · Diverse Musicological Studies
MethodsTransductive Inference
