WONDER: Weighted one-shot distributed ridge regression in high dimensions
Edgar Dobriban, Yue Sheng

TL;DR
This paper introduces WONDER, a novel weighted one-shot distributed ridge regression method that effectively combines local estimators, achieving significant computational savings while maintaining accuracy in high-dimensional settings.
Contribution
The paper proposes the WONDER algorithm with optimal weighting for distributed ridge regression, revealing new phenomena like the infinite-worker limit and bias-adjusted weights.
Findings
WONDER achieves at least 100x speedup in simulations and real data.
Optimal weights for combining estimators exceed unity, countering simple averaging.
Distributed estimator performs well even with very large numbers of machines.
Abstract
In many areas, practitioners need to analyze large datasets that challenge conventional single-machine computing. To scale up data analysis, distributed and parallel computing approaches are increasingly needed. Here we study a fundamental and highly important problem in this area: How to do ridge regression in a distributed computing environment? Ridge regression is an extremely popular method for supervised learning, and has several optimality properties, thus it is important to study. We study one-shot methods that construct weighted combinations of ridge regression estimators computed on each machine. By analyzing the mean squared error in a high dimensional random-effects model where each predictor has a small effect, we discover several new phenomena. 1. Infinite-worker limit: The distributed estimator works well for very large numbers of machines, a phenomenon we call…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
