TL;DR
This paper introduces FedFSL, a federated few-shot learning approach for mobile NLP that achieves high accuracy with minimal labeled data by combining pseudo labeling and prompt learning, while significantly reducing training costs.
Contribution
It pioneers federated NLP in few-shot scenarios, integrating pseudo labeling and prompt learning into a scalable system with novel cost-reduction strategies.
Findings
Achieves competitive accuracy with fewer than 100 labeled samples.
Reduces training delay, energy, and network traffic significantly.
Demonstrates effective federated learning with predominantly unlabeled data.
Abstract
Natural language processing (NLP) sees rich mobile applications. To support various language understanding tasks, a foundation NLP model is often fine-tuned in a federated, privacy-preserving setting (FL). This process currently relies on at least hundreds of thousands of labeled training samples from mobile clients; yet mobile users often lack willingness or knowledge to label their data. Such an inadequacy of data labels is known as a few-shot scenario; it becomes the key blocker for mobile NLP applications. For the first time, this work investigates federated NLP in the few-shot scenario (FedFSL). By retrofitting algorithmic advances of pseudo labeling and prompt learning, we first establish a training pipeline that delivers competitive accuracy when only 0.05% (fewer than 100) of the training data is labeled and the remaining is unlabeled. To instantiate the workflow, we further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
