Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022
Anthony Miyaguchi, Jiangyue Yu, Bryan Cheungvivatpant, Dakota Dudley,, Aniketh Swain

TL;DR
This paper presents an unsupervised learning approach using spectrogram motifs and triplet loss to classify bird sounds for the BirdCLEF 2022 challenge, achieving competitive results.
Contribution
It introduces an unsupervised representation learning method based on spectrogram motifs and triplet loss for bird sound classification.
Findings
Achieved a score of 0.48 on the public leaderboard
Developed an unsupervised spectrogram motif embedding
Demonstrated effectiveness of triplet loss in bird sound classification
Abstract
We build a classification model for the BirdCLEF 2022 challenge using unsupervised methods. We implement an unsupervised representation of the training dataset using a triplet loss on spectrogram representation of audio motifs. Our best model performs with a score of 0.48 on the public leaderboard.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Music and Audio Processing · Species Distribution and Climate Change
MethodsTriplet Loss
