Distributed Adaptive Huber Regression

Jiyu Luo; Qiang Sun; Wenxin Zhou

arXiv:2107.02726·stat.ME·July 7, 2021·Comput. Stat. Data Anal.

Distributed Adaptive Huber Regression

Jiyu Luo, Qiang Sun, Wenxin Zhou

PDF

Open Access

TL;DR

This paper proposes a communication-efficient distributed algorithm for robust linear regression that handles heavy-tailed and asymmetric errors, achieving near-centralized accuracy and reliable confidence intervals.

Contribution

It introduces a novel distributed Huber regression algorithm that is robust to heavy-tailed errors and achieves optimal error bounds with minimal communication.

Findings

01

Achieves centralized nonasymptotic error bounds.

02

Provides Berry-Esseen bounds for confidence intervals.

03

Outperforms existing distributed methods in accuracy and coverage.

Abstract

Distributed data naturally arise in scenarios involving multiple sources of observations, each stored at a different location. Directly pooling all the data together is often prohibited due to limited bandwidth and storage, or due to privacy protocols. This paper introduces a new robust distributed algorithm for fitting linear regressions when data are subject to heavy-tailed and/or asymmetric errors with finite second moments. The algorithm only communicates gradient information at each iteration and therefore is communication-efficient. Statistically, the resulting estimator achieves the centralized nonasymptotic error bound as if all the data were pooled together and came from a distribution with sub-Gaussian tails. Under a finite $(2 + δ)$ -th moment condition, we derive a Berry-Esseen bound for the distributed estimator, based on which we construct robust confidence intervals.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Neuroimaging Techniques and Applications · Sparse and Compressive Sensing Techniques