Adaptive Distributed Kernel Ridge Regression: A Feasible Distributed Learning Scheme for Data Silos
Di Wang, Xiaotong Liu, Shao-Bo Lin, Ding-Xuan Zhou

TL;DR
This paper introduces AdaDKRR, an adaptive distributed kernel ridge regression scheme that addresses data silos by balancing autonomy, privacy, and collaboration, with proven theoretical guarantees and superior empirical performance.
Contribution
It develops a novel adaptive distributed kernel ridge regression method that maintains data privacy, supports autonomous parameter selection, and proves its effectiveness both theoretically and empirically.
Findings
AdaDKRR performs similarly to optimal algorithms on full data.
It outperforms existing distributed learning schemes in experiments.
The scheme is effective in real-world applications like decision-making and forecasting.
Abstract
Data silos, mainly caused by privacy and interoperability, significantly constrain collaborations among different organizations with similar data for the same purpose. Distributed learning based on divide-and-conquer provides a promising way to settle the data silos, but it suffers from several challenges, including autonomy, privacy guarantees, and the necessity of collaborations. This paper focuses on developing an adaptive distributed kernel ridge regression (AdaDKRR) by taking autonomy in parameter selection, privacy in communicating non-sensitive information, and the necessity of collaborations in performance improvement into account. We provide both solid theoretical verification and comprehensive experiments for AdaDKRR to demonstrate its feasibility and effectiveness. Theoretically, we prove that under some mild conditions, AdaDKRR performs similarly to running the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Data Stream Mining Techniques
