High-Dimensional Distributed Sparse Classification with Scalable   Communication-Efficient Global Updates

Fred Lu; Ryan R. Curtin; Edward Raff; Francis Ferraro; James Holt

arXiv:2407.06346·cs.LG·July 10, 2024

High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates

Fred Lu, Ryan R. Curtin, Edward Raff, Francis Ferraro, James Holt

PDF

1 Repo

TL;DR

This paper introduces a scalable, communication-efficient distributed method for high-dimensional sparse logistic regression, effectively handling massive datasets with millions of features and improving accuracy with minimal communication rounds.

Contribution

It develops novel solutions to address divergence and sparsity challenges in distributed surrogate likelihood optimization for logistic regression on large-scale data.

Findings

01

Significant accuracy improvement over existing distributed algorithms

02

Fewer communication rounds needed for convergence

03

Comparable or faster runtimes on large datasets

Abstract

As the size of datasets used in statistical learning continues to grow, distributed training of models has attracted increasing attention. These methods partition the data and exploit parallelism to reduce memory and runtime, but suffer increasingly from communication costs as the data size or the number of iterations grows. Recent work on linear models has shown that a surrogate likelihood can be optimized locally to iteratively improve on an initial solution in a communication-efficient manner. However, existing versions of these methods experience multiple shortcomings as the data size becomes massive, including diverging updates and efficiently handling sparsity. In this work we develop solutions to these problems which enable us to learn a communication-efficient distributed logistic regression model even beyond millions of features. In our experiments we demonstrate a large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

futurecomputing4ai/proxcsl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression