Federated Continual Learning for Text Classification via Selective Inter-client Transfer
Yatin Chaudhary, Pranav Rai, Matthias Schubert, Hinrich Sch\"utze,, Pankaj Gupta

TL;DR
This paper introduces FedSeIT, a federated continual learning framework for text classification that selectively transfers knowledge between clients, improving performance while preserving privacy in heterogeneous task environments.
Contribution
It proposes a novel federated continual learning method with selective inter-client transfer and domain overlap assessment for NLP tasks.
Findings
Achieved an average of 12.4% performance gain over baselines.
First application of federated continual learning to NLP.
Effectively mitigated inter-client interference in heterogeneous tasks.
Abstract
In this work, we combine the two paradigms: Federated Learning (FL) and Continual Learning (CL) for text classification task in cloud-edge continuum. The objective of Federated Continual Learning (FCL) is to improve deep learning models over life time at each client by (relevant and efficient) knowledge transfer without sharing data. Here, we address challenges in minimizing inter-client interference while knowledge sharing due to heterogeneous tasks across clients in FCL setup. In doing so, we propose a novel framework, Federated Selective Inter-client Transfer (FedSeIT) which selectively combines model parameters of foreign clients. To further maximize knowledge transfer, we assess domain overlap and select informative tasks from the sequence of historical tasks at each foreign client while preserving privacy. Evaluating against the baselines, we show improved performance, a gain of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning
