MetaAudio: A Few-Shot Audio Classification Benchmark

Calum Heggan; Sam Budgett; Timothy Hospedales; Mehrdad Yaghoobi

arXiv:2204.02121·cs.SD·April 12, 2022

MetaAudio: A Few-Shot Audio Classification Benchmark

Calum Heggan, Sam Budgett, Timothy Hospedales, Mehrdad Yaghoobi

PDF

Open Access 1 Repo

TL;DR

MetaAudio introduces a comprehensive, reproducible audio-based benchmark for few-shot learning, demonstrating the effectiveness of gradient-based meta-learning methods across diverse sound datasets and settings.

Contribution

This work provides the first extensive audio-based few-shot classification benchmark and analyzes joint training and cross-dataset adaptation for improved generalization.

Findings

01

Gradient-based meta-learning methods outperform others.

02

Joint training improves generalization across datasets.

03

Cross-dataset adaptation shows promising results.

Abstract

Currently available benchmarks for few-shot learning (machine learning with few training examples) are limited in the domains they cover, primarily focusing on image classification. This work aims to alleviate this reliance on image-based benchmarks by offering the first comprehensive, public and fully reproducible audio based alternative, covering a variety of sound domains and experimental settings. We compare the few-shot classification performance of a variety of techniques on seven audio datasets (spanning environmental sounds to human-speech). Extending this, we carry out in-depth analyses of joint training (where all datasets are used during training) and cross-dataset adaptation protocols, establishing the possibility of a generalised audio few-shot classification algorithm. Our experimentation shows gradient-based meta-learning methods such as MAML and Meta-Curvature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cheggan/metaaudio-a-few-shot-audio-classification-benchmark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Diverse Musicological Studies

MethodsModel-Agnostic Meta-Learning