Federated Self-Supervised Learning for Acoustic Event Classification
Meng Feng, Chieh-Chi Kao, Qingming Tang, Ming Sun, Viktor Rozgic,, Spyros Matsoukas, Chao Wang

TL;DR
This paper explores federated self-supervised learning for acoustic event classification, enabling privacy-preserving on-device representation learning that improves classifier performance without sharing raw data.
Contribution
It introduces a novel federated self-supervised learning approach for AEC that does not rely on pseudo labels or user-generated targets, enhancing privacy and performance.
Findings
Up to 20.3% relative precision improvement
Maintains recall while improving precision
Demonstrates feasibility of privacy-preserving on-device learning
Abstract
Standard acoustic event classification (AEC) solutions require large-scale collection of data from client devices for model optimization. Federated learning (FL) is a compelling framework that decouples data collection and model training to enhance customer privacy. In this work, we investigate the feasibility of applying FL to improve AEC performance while no customer data can be directly uploaded to the server. We assume no pseudo labels can be inferred from on-device user inputs, aligning with the typical use cases of AEC. We adapt self-supervised learning to the FL framework for on-device continual learning of representations, and it results in improved performance of the downstream AEC classifiers without labeled/pseudo-labeled data available. Compared to the baseline w/o FL, the proposed method improves precision up to 20.3\% relatively while maintaining the recall. Our work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
