Dynamic Gradient Aggregation for Federated Domain Adaptation

Dimitrios Dimitriadis; Kenichi Kumatani; Robert Gmyr; Yashesh Gaur and; Sefik Emre Eskimez

arXiv:2106.07578·cs.LG·June 15, 2021

Dynamic Gradient Aggregation for Federated Domain Adaptation

Dimitrios Dimitriadis, Kenichi Kumatani, Robert Gmyr, Yashesh Gaur and, Sefik Emre Eskimez

PDF

Open Access

TL;DR

This paper introduces a novel federated learning algorithm with weighted gradient aggregation that significantly accelerates convergence and improves speech recognition performance in supervised and unsupervised settings.

Contribution

A new federated learning scheme using weighted gradient aggregation with two-step optimization, enhancing convergence speed and model accuracy in speech recognition tasks.

Findings

01

7x faster convergence on LibriSpeech

02

6% WER reduction in supervised SR

03

20% WER improvement in session adaptation

Abstract

In this paper, a new learning algorithm for Federated Learning (FL) is introduced. The proposed scheme is based on a weighted gradient aggregation using two-step optimization to offer a flexible training pipeline. Herein, two different flavors of the aggregation method are presented, leading to an order of magnitude improvement in convergence speed compared to other distributed or FL training algorithms like BMUF and FedAvg. Further, the aggregation algorithm acts as a regularizer of the gradient quality. We investigate the effect of our FL algorithm in supervised and unsupervised Speech Recognition (SR) scenarios. The experimental validation is performed based on three tasks: first, the LibriSpeech task showing a speed-up of 7x and 6% word error rate reduction (WERR) compared to the baseline results. The second task is based on session adaptation providing 20% WERR over a powerful LAS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning · Music and Audio Processing