Segment Relevance Estimation for Audio Analysis and Weakly-Labelled   Classification

Juliano Henrique Foleiss; Tiago Fernandes Tavares

arXiv:1911.04666·eess.AS·November 13, 2019·1 cites

Segment Relevance Estimation for Audio Analysis and Weakly-Labelled Classification

Juliano Henrique Foleiss, Tiago Fernandes Tavares

PDF

Open Access

TL;DR

This paper introduces a relevance estimation method for audio segments in weakly-labelled classification, enabling adaptive, user-defined audio analysis without extra training, and proposes RELNET, a neural network leveraging this relevance for improved classification.

Contribution

The paper presents a novel relevance measure for audio segments that adapts to user-defined criteria without additional training, and introduces RELNET, a neural network architecture utilizing this measure for weakly-labelled audio classification.

Findings

01

RELNET achieved competitive results on DCASE2018 dataset.

02

The relevance measure effectively highlights important audio segments.

03

The method adapts to different user-defined viewpoints without retraining.

Abstract

We propose a method that quantifies the importance, namely relevance, of audio segments for classification in weakly-labelled problems. It works by drawing information from a set of class-wise one-vs-all classifiers. By selecting the classifiers used in each specific classification problem, the relevance measure adapts to different user-defined viewpoints without requiring additional neural network training. This characteristic allows the relevance measure to highlight audio segments that quickly adapt to user-defined criteria. Such functionality can be used for computer-assisted audio analysis. Also, we propose a neural network architecture, namely RELNET, that leverages the relevance measure for weakly-labelled audio classification problems. RELNET was evaluated in the DCASE2018 dataset and achieved competitive classification results when compared to previous attention-based proposals.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies