FedAdapter: Efficient Federated Learning for Modern NLP
Dongqi Cai, Yaozong Wu, Shangguang Wang, Felix Xiaozhu Lin, Mengwei Xu

TL;DR
FedAdapter introduces an adaptive framework that significantly accelerates federated NLP training by dynamically configuring adapter modules, enabling practical deployment of large pre-trained models across diverse tasks and resource constraints.
Contribution
The paper proposes FedAdapter, a novel method that automates adapter configuration and progressively upgrades them during training to improve efficiency in federated NLP.
Findings
Reduces FedNLP convergence time by up to 155.5x
Enables training of large models on mobile devices
Achieves faster training without sacrificing accuracy
Abstract
Transformer-based pre-trained models have revolutionized NLP for superior performance and generality. Fine-tuning pre-trained models for downstream tasks often requires private data, for which federated learning is the de-facto approach (i.e., FedNLP). However, our measurements show that FedNLP is prohibitively slow due to the large model sizes and the resultant high network/computation cost. Towards practical FedNLP, we identify as the key building blocks adapters, small bottleneck modules inserted at a variety of model layers. A key challenge is to properly configure the depth and width of adapters, to which the training speed and efficiency is highly sensitive. No silver-bullet configuration exists: the optimal choice varies across downstream NLP tasks, desired model accuracy, and mobile resources. To automate adapter configuration, we propose FedAdapter, a framework that enhances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · COVID-19 diagnosis using AI · Artificial Intelligence in Healthcare and Education
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Adapter
