Identifying Heterogeneity in Distributed Learning
Zelin Xiao, Jia Gu, Song Xi Chen

TL;DR
This paper develops and compares methods for detecting heterogeneity in distributed data settings, focusing on minimal communication and robustness across different heterogeneity sparsity levels.
Contribution
It introduces a Wald-based test and an extreme contrast test for heterogeneity detection, with a combined approach for improved robustness and efficiency.
Findings
Wald test is consistent when the number of data blocks is small relative to sample size.
Extreme contrast test is effective for sparse heterogeneity with many data blocks.
Combined testing approach enhances robustness across heterogeneity levels.
Abstract
We study methods for identifying heterogeneous parameter components in distributed M-estimation with minimal data transmission. One is based on a re-normalized Wald test, which is shown to be consistent as long as the number of distributed data blocks is of a smaller order of the minimum block sample size and the level of heterogeneity is dense. The second one is an extreme contrast test (ECT) based on the difference between the largest and smallest component-wise estimated parameters among data blocks. By introducing a sample splitting procedure, the ECT can avoid the bias accumulation arising from the M-estimation procedures, and exhibits consistency for being much larger than the sample size while the heterogeneity is sparse. The ECT procedure is easy to operate and communication-efficient. A combination of the Wald and the extreme contrast tests is formulated to attain more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data · Distributed Sensor Networks and Detection Algorithms
