Federating Dynamic Models using Early-Exit Architectures for Automatic   Speech Recognition on Heterogeneous Clients

Mohamed Nabih Ali; Alessio Brutti; Daniele Falavigna

arXiv:2405.17376·cs.CL·May 28, 2024

Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients

Mohamed Nabih Ali, Alessio Brutti, Daniele Falavigna

PDF

Open Access 1 Repo 2 Models

TL;DR

This paper introduces a federated learning approach for speech recognition that uses early-exit dynamic models to adapt to heterogeneous client devices, improving efficiency and privacy.

Contribution

It proposes a novel federated learning method employing early-exit architectures for adaptive, resource-efficient speech recognition on diverse edge devices.

Findings

01

Effective performance on public datasets

02

Compatible with basic federated learning strategies

03

Reduces computational burden on edge devices

Abstract

Automatic speech recognition models require large amounts of speech recordings for training. However, the collection of such data often is cumbersome and leads to privacy concerns. Federated learning has been widely used as an effective decentralized technique that collaboratively learns a shared prediction model while keeping the data local on different clients. Unfortunately, client devices often feature limited computation and communication resources leading to practical difficulties for large models. In addition, the heterogeneity that characterizes edge devices makes it sub-optimal to generate a single model that fits all of them. Differently from the recent literature, where multiple models with different architectures are used, in this work, we propose using dynamical architectures which, employing early-exit solutions, can adapt their processing (i.e. traversed layers) depending…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mnabihali/ASR-FL
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis