FedAdapter: Efficient Federated Learning for Modern NLP

Dongqi Cai; Yaozong Wu; Shangguang Wang; Felix Xiaozhu Lin; Mengwei Xu

arXiv:2205.10162·cs.LG·May 10, 2023·6 cites

FedAdapter: Efficient Federated Learning for Modern NLP

Dongqi Cai, Yaozong Wu, Shangguang Wang, Felix Xiaozhu Lin, Mengwei Xu

PDF

Open Access 1 Repo

TL;DR

FedAdapter introduces an adaptive framework that significantly accelerates federated NLP training by dynamically configuring adapter modules, enabling practical deployment of large pre-trained models across diverse tasks and resource constraints.

Contribution

The paper proposes FedAdapter, a novel method that automates adapter configuration and progressively upgrades them during training to improve efficiency in federated NLP.

Findings

01

Reduces FedNLP convergence time by up to 155.5x

02

Enables training of large models on mobile devices

03

Achieves faster training without sacrificing accuracy

Abstract

Transformer-based pre-trained models have revolutionized NLP for superior performance and generality. Fine-tuning pre-trained models for downstream tasks often requires private data, for which federated learning is the de-facto approach (i.e., FedNLP). However, our measurements show that FedNLP is prohibitively slow due to the large model sizes and the resultant high network/computation cost. Towards practical FedNLP, we identify as the key building blocks adapters, small bottleneck modules inserted at a variety of model layers. A key challenge is to properly configure the depth and width of adapters, to which the training speed and efficiency is highly sensitive. No silver-bullet configuration exists: the optimal choice varies across downstream NLP tasks, desired model accuracy, and mobile resources. To automate adapter configuration, we propose FedAdapter, a framework that enhances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UbiquitousLearning/FedAdapter
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · COVID-19 diagnosis using AI · Artificial Intelligence in Healthcare and Education

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Adapter