An Information-Theoretic Analysis for Federated Learning under Concept Drift
Fu Peng, Meng Zhang, Ming Tang

TL;DR
This paper analyzes how concept drift affects federated learning using information theory and proposes a regularization algorithm to improve long-term performance under various drift patterns, validated through experiments on Raspberry Pi devices.
Contribution
It models concept drift as a Markov chain, introduces the Stationary Generalization Error, and develops a KL divergence and mutual information-based regularization algorithm for federated learning.
Findings
Drift patterns significantly impact FL performance.
The proposed method outperforms existing approaches across drift types.
Experimental validation confirms theoretical insights.
Abstract
Recent studies in federated learning (FL) commonly train models on static datasets. However, real-world data often arrives as streams with shifting distributions, causing performance degradation known as concept drift. This paper analyzes FL performance under concept drift using information theory and proposes an algorithm to mitigate the performance degradation. We model concept drift as a Markov chain and introduce the \emph{Stationary Generalization Error} to assess a model's capability to capture characteristics of future unseen data. Its upper bound is derived using KL divergence and mutual information. We study three drift patterns (periodic, gradual, and random) and their impact on FL performance. Inspired by this, we propose an algorithm that regularizes the empirical risk minimization approach with KL divergence and mutual information, thereby enhancing long-term performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Privacy-Preserving Technologies in Data · Caching and Content Delivery
