Federated Online Learning for Heterogeneous Multisource Streaming Data
Jingmao Li, Yuanxing Chen, Shuangge Ma, Kuangnan Fang

TL;DR
This paper introduces a federated online learning method tailored for multi-source streaming data, addressing heterogeneity and privacy concerns while ensuring efficient, consistent, and accurate model estimation and prediction.
Contribution
It proposes a novel federated online learning framework with personalized models and subgroup assumptions, improving performance on heterogeneous streaming data.
Findings
The method achieves consistent model estimation and subgroup recovery.
It demonstrates superior prediction accuracy on real-world datasets.
The approach reduces storage needs by using summary statistics.
Abstract
Federated learning has emerged as an essential paradigm for distributed multi-source data analysis under privacy concerns. Most existing federated learning methods focus on the ``static" datasets. However, in many real-world applications, data arrive continuously over time, forming streaming datasets. This introduces additional challenges for data storage and algorithm design, particularly under high-dimensional settings. In this paper, we propose a federated online learning (FOL) method for distributed multi-source streaming data analysis. To account for heterogeneity, a personalized model is constructed for each data source, and a novel ``subgroup" assumption is employed to capture potential similarities, thereby enhancing model performance. We adopt the penalized renewable estimation method and the efficient proximal gradient descent for model training. The proposed method aligns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing
