Distributed Convoluted Rank Regression for Non-Shareable Data under Non-Additive Losses
Wen Zhang, Liping Zhu, Songshan Yang

TL;DR
This paper introduces a novel distributed high-dimensional rank regression method for non-additive losses, ensuring statistical efficiency and model selection consistency with minimal communication.
Contribution
It develops a surrogate loss for convoluted rank regression in distributed settings, maintaining statistical properties and enabling scalable, communication-efficient estimation.
Findings
Surrogate loss shares the same population minimizer as full-data CRR.
Establishes non-asymptotic error bounds and oracle properties.
Achieves centralized oracle rates with O(log N) communication rounds.
Abstract
We study high-dimensional rank regression when data are distributed across multiple machines and the loss is a non-additive U-statistic, as in convoluted rank regression (CRR). Classical communication-efficient surrogate likelihood (CSL) methods crucially rely on the additivity of the empirical loss and therefore break down for CRR, whose global loss couples all sample pairs across machines. We propose a distributed convoluted rank regression (DCRR) framework that constructs a similar surrogate loss and demonstrate its validity under the non-additive losses. We show that this surrogate shares the same population minimizer as the full-data CRR loss and yields estimators that are statistically equivalent to centralized CRR. Building on this, we develop a two-stage sparse DCRR procedure -- an iterative -penalized stage followed by a folded-concave refinement -- and establish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data
