Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Anthony Miyaguchi; Jiangyue Yu; Bryan Cheungvivatpant; Dakota Dudley,; Aniketh Swain

arXiv:2206.04805·cs.SD·July 10, 2024

Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Anthony Miyaguchi, Jiangyue Yu, Bryan Cheungvivatpant, Dakota Dudley,, Aniketh Swain

PDF

Open Access 1 Repo

TL;DR

This paper presents an unsupervised learning approach using spectrogram motifs and triplet loss to classify bird sounds for the BirdCLEF 2022 challenge, achieving competitive results.

Contribution

It introduces an unsupervised representation learning method based on spectrogram motifs and triplet loss for bird sound classification.

Findings

01

Achieved a score of 0.48 on the public leaderboard

02

Developed an unsupervised spectrogram motif embedding

03

Demonstrated effectiveness of triplet loss in bird sound classification

Abstract

We build a classification model for the BirdCLEF 2022 challenge using unsupervised methods. We implement an unsupervised representation of the training dataset using a triplet loss on spectrogram representation of audio motifs. Our best model performs with a score of 0.48 on the public leaderboard.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

acmiyaguchi/birdclef-2022
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnimal Vocal Communication and Behavior · Music and Audio Processing · Species Distribution and Climate Change

MethodsTriplet Loss