TL;DR
This paper introduces a novel collaborative framework for federated continual learning that leverages small local models to enhance large foundation models, addressing challenges of data privacy, resource constraints, and continual adaptation.
Contribution
It proposes the first framework combining small and large models in federated continual learning, with novel techniques for continual fine-tuning and knowledge distillation.
Findings
Superior performance with heterogeneous small models
Effective prevention of small model forgetting
Enhanced large model utility through collaboration
Abstract
Continual learning (CL) for Foundation Models (FMs) is an essential yet underexplored challenge, especially in Federated Continual Learning (FCL), where each client learns from a private, evolving task stream under strict data and communication constraints. Despite their powerful generalization abilities, FMs often exhibit suboptimal performance on local downstream tasks, as they are unable to utilize private local data. Furthermore, enabling FMs to learn new tasks without forgetting prior knowledge is inherently a challenging problem, primarily due to their immense parameter count and high model complexity. In contrast, small models can be trained locally under resource-constrained conditions and benefit from more mature CL techniques. To bridge the gap between small models and FMs, we propose the first collaborative framework in FCL, where lightweight local models act as a dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
