Population Expansion for Training Language Models with Private Federated Learning
Tatsuki Koga, Congzheng Song, Martin Pelikan, Mona Chitnis

TL;DR
This paper proposes a population expansion method using domain adaptation to enhance federated learning with differential privacy, especially for small populations, leading to faster training and better model utility.
Contribution
It introduces a novel population expansion approach for federated learning with differential privacy, improving training efficiency and model quality in small population scenarios.
Findings
Utility improved by 13% to 30% on language datasets
Faster training with smaller populations
Enhanced model performance with domain adaptation
Abstract
Federated learning (FL) combined with differential privacy (DP) offers machine learning (ML) training with distributed devices and with a formal privacy guarantee. With a large population of devices, FL with DP produces a performant model in a timely manner. However, for applications with a smaller population, not only does the model utility degrade as the DP noise is inversely proportional to population, but also the training latency increases since waiting for enough clients to become available from a smaller pool is slower. In this work, we thus propose expanding the population based on domain adaptation techniques to speed up the training and improves the final model quality when training with small populations. We empirically demonstrate that our techniques can improve the utility by 13% to 30% on real-world language modeling datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
