Federated Few-Shot Learning for Mobile NLP

Dongqi Cai; Shangguang Wang; Yaozong Wu; Felix Xiaozhu Lin; Mengwei Xu

arXiv:2212.05974·cs.LG·August 22, 2023

Federated Few-Shot Learning for Mobile NLP

Dongqi Cai, Shangguang Wang, Yaozong Wu, Felix Xiaozhu Lin, Mengwei Xu

PDF

1 Repo

TL;DR

This paper introduces FedFSL, a federated few-shot learning approach for mobile NLP that achieves high accuracy with minimal labeled data by combining pseudo labeling and prompt learning, while significantly reducing training costs.

Contribution

It pioneers federated NLP in few-shot scenarios, integrating pseudo labeling and prompt learning into a scalable system with novel cost-reduction strategies.

Findings

01

Achieves competitive accuracy with fewer than 100 labeled samples.

02

Reduces training delay, energy, and network traffic significantly.

03

Demonstrates effective federated learning with predominantly unlabeled data.

Abstract

Natural language processing (NLP) sees rich mobile applications. To support various language understanding tasks, a foundation NLP model is often fine-tuned in a federated, privacy-preserving setting (FL). This process currently relies on at least hundreds of thousands of labeled training samples from mobile clients; yet mobile users often lack willingness or knowledge to label their data. Such an inadequacy of data labels is known as a few-shot scenario; it becomes the key blocker for mobile NLP applications. For the first time, this work investigates federated NLP in the few-shot scenario (FedFSL). By retrofitting algorithmic advances of pseudo labeling and prompt learning, we first establish a training pipeline that delivers competitive accuracy when only 0.05% (fewer than 100) of the training data is labeled and the remaining is unlabeled. To instantiate the workflow, we further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ubiquitouslearning/fes
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.