Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On   Federated Learning using Multiview Pseudo-Labeling

Tiantian Feng; Shrikanth Narayanan

arXiv:2203.08810·eess.AS·April 18, 2023

Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling

Tiantian Feng, Shrikanth Narayanan

PDF

Open Access 1 Repo

TL;DR

Semi-FedSER introduces a semi-supervised federated learning framework for speech emotion recognition that effectively leverages unlabeled data to improve performance while preserving privacy, demonstrated on benchmark datasets.

Contribution

This work presents the first semi-supervised federated learning approach for SER, utilizing multiview pseudo-labeling to enhance model accuracy with limited labeled data.

Findings

01

Achieves good SER performance with only 20% labeled data

02

Demonstrates effectiveness on IEMOCAP and MSP-Improv datasets

03

Preserves privacy by avoiding raw data sharing

Abstract

Speech Emotion Recognition (SER) application is frequently associated with privacy concerns as it often acquires and transmits speech data at the client-side to remote cloud platforms for further processing. These speech data can reveal not only speech content and affective information but the speaker's identity, demographic traits, and health status. Federated learning (FL) is a distributed machine learning algorithm that coordinates clients to train a model collaboratively without sharing local data. This algorithm shows enormous potential for SER applications as sharing raw speech or speech features from a user's device is vulnerable to privacy attacks. However, a major challenge in FL is limited availability of high-quality labeled data samples. In this work, we propose a semi-supervised federated learning framework, Semi-FedSER, that utilizes both labeled and unlabeled data samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

usc-sail/fed-ser-semi
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis