Federated Learning with Dynamic Transformer for Text to Speech

Zhenhou Hong; Jianzong Wang; Xiaoyang Qu; Jie Liu; Chendong Zhao; Jing; Xiao

arXiv:2107.08795·cs.LG·July 20, 2021·1 cites

Federated Learning with Dynamic Transformer for Text to Speech

Zhenhou Hong, Jianzong Wang, Xiaoyang Qu, Jie Liu, Chendong Zhao, Jing, Xiao

PDF

Open Access

TL;DR

This paper introduces a federated dynamic transformer for text-to-speech that improves training stability, reduces communication costs, and approaches centralized model performance, addressing privacy and efficiency challenges in federated learning.

Contribution

It proposes a novel federated dynamic transformer model that enhances convergence speed, stability, and communication efficiency for federated TTS systems.

Findings

01

Achieves near-centralized performance with more clients.

02

Faster and more stable training convergence.

03

Significantly reduces communication overhead.

Abstract

Text to speech (TTS) is a crucial task for user interaction, but TTS model training relies on a sizable set of high-quality original datasets. Due to privacy and security issues, the original datasets are usually unavailable directly. Recently, federated learning proposes a popular distributed machine learning paradigm with an enhanced privacy protection mechanism. It offers a practical and secure framework for data owners to collaborate with others, thus obtaining a better global model trained on the larger dataset. However, due to the high complexity of transformer models, the convergence process becomes slow and unstable in the federated learning setting. Besides, the transformer model trained in federated learning is costly communication and limited computational speed on clients, impeding its popularity. To deal with these challenges, we propose the federated dynamic transformer.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Mobile Crowdsensing and Crowdsourcing