Training neural audio classifiers with few data

Jordi Pons; Joan Serr\`a; Xavier Serra

arXiv:1810.10274·cs.SD·November 7, 2018

Training neural audio classifiers with few data

Jordi Pons, Joan Serr\`a, Xavier Serra

PDF

2 Repos

TL;DR

This paper explores various supervised learning strategies, including regularization, prototypical networks, and transfer learning, to improve neural audio classifiers trained on small datasets, demonstrating transfer learning's effectiveness and prototypical networks' promise.

Contribution

It systematically evaluates the effectiveness of regularization, prototypical networks, and transfer learning for small dataset audio classification tasks.

Findings

01

Transfer learning significantly improves performance on small datasets.

02

Prototypical networks perform well without external data.

03

Regularization alone offers limited benefits.

Abstract

We investigate supervised learning strategies that improve the training of neural network audio classifiers on small annotated collections. In particular, we study whether (i) a naive regularization of the solution space, (ii) prototypical networks, (iii) transfer learning, or (iv) their combination, can foster deep learning models to better leverage a small amount of training examples. To this end, we evaluate (i-iv) for the tasks of acoustic event recognition and acoustic scene classification, considering from 1 to 100 labeled examples per class. Results indicate that transfer learning is a powerful strategy in such scenarios, but prototypical networks show promising results when one does not count with external or validation data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.