Fast threshold optimization for multi-label audio tagging using   Surrogate gradient learning

Thomas Pellegrini (IRIT-SAMoVA); Timoth\'ee Masquelier (CERCO)

arXiv:2103.00833·cs.AI·March 2, 2021

Fast threshold optimization for multi-label audio tagging using Surrogate gradient learning

Thomas Pellegrini (IRIT-SAMoVA), Timoth\'ee Masquelier (CERCO)

PDF

1 Repo

TL;DR

This paper introduces SGL-Thresh, a gradient-based method for automatically optimizing decision thresholds in multi-label audio tagging, significantly improving F1 scores on multiple datasets using pre-trained neural networks.

Contribution

The paper presents a novel surrogate gradient learning approach for threshold optimization, enabling fast and scalable F1 maximization in multi-label audio classification.

Findings

01

SGL-Thresh outperforms baseline and heuristic methods in F1 score.

02

Achieved 54.9% F1 on AudioSet, surpassing 50.7% with default thresholds.

03

Method is fast, scalable, and applicable to large tag sets.

Abstract

Multi-label audio tagging consists of assigning sets of tags to audio recordings. At inference time, thresholds are applied on the confidence scores outputted by a probabilistic classifier, in order to decide which classes are detected active. In this work, we consider having at disposal a trained classifier and we seek to automatically optimize the decision thresholds according to a performance metric of interest, in our case F-measure (micro-F1). We propose a new method, called SGL-Thresh for Surrogate Gradient Learning of Thresholds, that makes use of gradient descent. Since F1 is not differentiable, we propose to approximate the thresholding operation gradients with the gradients of a sigmoid function. We report experiments on three datasets, using state-of-the-art pre-trained deep neural networks. In all cases, SGL-Thresh outperformed three other approaches: a default threshold…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

topel/SGLThresh
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.