Federated Self-Training for Semi-Supervised Audio Recognition

Vasileios Tsouvalas; Aaqib Saeed; Tanir Ozcelebi

arXiv:2107.06877·cs.LG·February 28, 2022

Federated Self-Training for Semi-Supervised Audio Recognition

Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi

PDF

Open Access 1 Repo

TL;DR

This paper introduces FedSTAR, a federated semi-supervised learning method using self-training and self-supervised pre-training to enhance audio recognition models on decentralized unlabeled data, achieving significant accuracy improvements with minimal labeled data.

Contribution

The paper proposes FedSTAR, a novel federated semi-supervised learning approach that leverages unlabeled data and self-supervised pre-training to improve on-device audio recognition.

Findings

01

FedSTAR improves recognition rate by 13.28% with only 3% labeled data.

02

Self-supervised pre-training accelerates training convergence.

03

Performance is validated on diverse public audio datasets.

Abstract

Federated Learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones and virtual assistants, labeling is entrusted to the clients, or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result, an abundance of audio data remains unlabeled and unexploited on users' devices. Most existing federated learning approaches focus on supervised learning without harnessing the unlabeled data. In this work, we study the problem of semi-supervised learning of audio models via self-training in conjunction with federated learning. We propose FedSTAR to exploit large-scale on-device unlabeled data to improve the generalization of audio recognition models. We further demonstrate that self-supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FederatedSTAR/FedSTAR
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Advanced Data and IoT Technologies