Continual Adaptation of Vision Transformers for Federated Learning
Shaunak Halbe, James Seale Smith, Junjiao Tian, Zsolt Kira

TL;DR
This paper introduces a novel, efficient method for continual federated learning with vision transformers that reduces forgetting and communication costs without compromising privacy, suitable for large-scale image classification tasks.
Contribution
It proposes a prompting-based approach with a lightweight generation and distillation scheme for continual federated learning using vision transformers, addressing data heterogeneity and forgetting.
Findings
Outperforms existing methods by up to 7% accuracy.
Reduces communication and computation costs significantly.
Effective on CIFAR-100, ImageNet-R, and DomainNet datasets.
Abstract
In this paper, we focus on the important yet understudied problem of Continual Federated Learning (CFL), where a server communicates with a set of clients to incrementally learn new concepts over time without sharing or storing any data. The complexity of this problem is compounded by challenges from both the Continual and Federated Learning perspectives. Specifically, models trained in a CFL setup suffer from catastrophic forgetting which is exacerbated by data heterogeneity across clients. Existing attempts at this problem tend to impose large overheads on clients and communication channels or require access to stored data which renders them unsuitable for real-world use due to privacy. In this paper, we attempt to tackle forgetting and heterogeneity while minimizing overhead costs and without requiring access to any stored data. We study this problem in the context of Vision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · COVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning
MethodsFocus
