Episodic fine-tuning prototypical networks for optimization-based   few-shot learning: Application to audio classification

Xuanyu Zhuang (LTCI; IP Paris; S2A; IDS); Geoffroy Peeters (LTCI; IP; Paris; S2A; IDS); Ga\"el Richard (S2A; IDS; LTCI; IP Paris)

arXiv:2410.05302·eess.AS·October 10, 2024

Episodic fine-tuning prototypical networks for optimization-based few-shot learning: Application to audio classification

Xuanyu Zhuang (LTCI, IP Paris, S2A, IDS), Geoffroy Peeters (LTCI, IP, Paris, S2A, IDS), Ga\"el Richard (S2A, IDS, LTCI, IP Paris)

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel episodic fine-tuning approach for Prototypical Networks, enhancing few-shot audio classification performance by combining them with optimization-based algorithms like MAML and Meta-Curvature.

Contribution

It proposes a simple yet effective fine-tuning method for ProtoNet and integrates it with optimization-based FSL algorithms to improve adaptation in few-shot learning.

Findings

01

Significant performance improvements over standard ProtoNet on ESC-50 and Speech Commands v2 datasets.

02

The combined models outperform regular ProtoNet in few-shot audio classification.

03

The method is general and adaptable to other domains beyond audio.

Abstract

The Prototypical Network (ProtoNet) has emerged as a popular choice in Few-shot Learning (FSL) scenarios due to its remarkable performance and straightforward implementation. Building upon such success, we first propose a simple (yet novel) method to fine-tune a ProtoNet on the (labeled) support set of the test episode of a C-way-K-shot test episode (without using the query set which is only used for evaluation). We then propose an algorithmic framework that combines ProtoNet with optimization-based FSL algorithms (MAML and Meta-Curvature) to work with such a fine-tuning method. Since optimization-based algorithms endow the target learner model with the ability to fast adaption to only a few samples, we utilize ProtoNet as the target model to enhance its fine-tuning performance with the help of a specifically designed episodic fine-tuning strategy. The experimental results confirm that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zdsy/proto-MAML
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Machine Learning and ELM

MethodsSparse Evolutionary Training