Couple Learning for semi-supervised sound event detection

Rui Tao; Long Yan; Kazushige Ouchi; Xiangdong Wang

arXiv:2110.05809·cs.LG·February 25, 2022

Couple Learning for semi-supervised sound event detection

Rui Tao, Long Yan, Kazushige Ouchi, Xiangdong Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a Couple Learning approach that enhances semi-supervised sound event detection by combining a well-trained model with a Mean Teacher model, improving pseudo-label quality and overall performance.

Contribution

It proposes a novel Couple Learning method that integrates a well-trained model with Mean Teacher, boosting semi-supervised sound event detection accuracy.

Findings

01

Achieved 44.25% F1-score on DCASE2020 Task 4

02

Outperformed baseline with 32.39% F1-score

03

Validated effectiveness through Variable Order Input experiment

Abstract

The recently proposed Mean Teacher method, which exploits large-scale unlabeled data in a self-ensembling manner, has achieved state-of-the-art results in several semi-supervised learning benchmarks. Spurred by current achievements, this paper proposes an effective Couple Learning method that combines a well-trained model and a Mean Teacher model. The suggested pseudo-labels generated model (PLG) increases strongly- and weakly-labeled data to improve the Mean Teacher method-s performance. Moreover, the Mean Teacher-s consistency cost reduces the noise impact in the pseudo-labels introduced by detection errors. The experimental results on Task 4 of the DCASE2020 challenge demonstrate the superiority of the proposed method, achieving about 44.25% F1-score on the public evaluation set, significantly outperforming the baseline system-s 32.39%. At the same time, we also propose a simple and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Water Systems and Optimization