Buffer-based Gradient Projection for Continual Federated Learning

Shenghong Dai; Jy-yong Sohn; Yicong Chen; S M Iftekharul Alam,; Ravikumar Balakrishnan; Suman Banerjee; Nageen Himayat; Kangwook Lee

arXiv:2409.01585·cs.LG·September 5, 2024

Buffer-based Gradient Projection for Continual Federated Learning

Shenghong Dai, Jy-yong Sohn, Yicong Chen, S M Iftekharul Alam,, Ravikumar Balakrishnan, Suman Banerjee, Nageen Himayat, Kangwook Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces Fed-A-GEM, a buffer-based gradient projection method for continual federated learning that reduces catastrophic forgetting by using local and aggregated buffer gradients, improving accuracy on benchmarks.

Contribution

It proposes Fed-A-GEM, a novel federated adaptation of A-GEM that leverages buffer-based gradient projection to mitigate forgetting without requiring task boundary knowledge.

Findings

01

Achieves up to 27% accuracy improvement on CIFAR-100.

02

Consistently outperforms existing CFL methods across benchmarks.

03

Effectively mitigates catastrophic forgetting in federated settings.

Abstract

Continual Federated Learning (CFL) is essential for enabling real-world applications where multiple decentralized clients adaptively learn from continuous data streams. A significant challenge in CFL is mitigating catastrophic forgetting, where models lose previously acquired knowledge when learning new information. Existing approaches often face difficulties due to the constraints of device storage capacities and the heterogeneous nature of data distributions among clients. While some CFL algorithms have addressed these challenges, they frequently rely on unrealistic assumptions about the availability of task boundaries (i.e., knowing when new tasks begin). To address these limitations, we introduce Fed-A-GEM, a federated adaptation of the A-GEM method (Chaudhry et al., 2019), which employs a buffer-based gradient projection approach. Fed-A-GEM alleviates catastrophic forgetting by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shenghongdai/fed-a-gem
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks