Improving Generalization in Heterogeneous Federated Continual Learning via Spatio-Temporal Gradient Matching with Prototypical Coreset
Minh-Duong Nguyen, Le-Tuan Nguyen, Quoc-Viet Pham

TL;DR
This paper introduces STAMP, a novel federated continual learning method that uses spatio-temporal gradient matching and prototypical coresets to improve generalization and mitigate catastrophic forgetting in heterogeneous, streaming data scenarios.
Contribution
The paper proposes a model-agnostic prototype selection method and a spatio-temporal gradient matching approach to enhance federated continual learning under data heterogeneity.
Findings
STAMP outperforms existing baselines in experiments.
Prototypes effectively approximate task-wise gradients.
Gradient matching reduces catastrophic forgetting.
Abstract
Federated Continual Learning (FCL) has recently emerged as a crucial research area, as data from distributed clients typically arrives as a stream, requiring sequential learning. This paper explores a more practical and challenging FCL setting, where clients may have unrelated or even conflicting data and tasks. In this scenario, statistical heterogeneity and data noise can create spurious correlations, leading to biased feature learning and catastrophic forgetting. Existing FCL approaches often use generative replay to create pseudo-datasets of previous tasks. However, generative replay itself suffers from catastrophic forgetting and task divergence among clients, leading to overfitting in FCL. Existing FCL approaches often use generative replay to create pseudo-datasets of previous tasks. However, generative replay itself suffers from catastrophic forgetting and task divergence among…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning · Gait Recognition and Analysis
