Learning Large Scale Sparse Models

Atul Dhingra; Jie Shen; Nicholas Kleene

arXiv:2301.10958·stat.ML·January 31, 2023

Learning Large Scale Sparse Models

Atul Dhingra, Jie Shen, Nicholas Kleene

PDF

Open Access

TL;DR

This paper introduces online methods for learning large-scale sparse models like Lasso, addressing computational and memory challenges by updating models with single samples and analyzing the effects on sparsity.

Contribution

It proposes novel online algorithms for sparse model learning that are memory-efficient and analyzes the impact on sparsity compared to batch methods.

Findings

01

Memory cost is independent of sample size.

02

Sparsity from batch methods is not preserved online.

03

Mini-batch and thresholding variants improve performance.

Abstract

In this work, we consider learning sparse models in large scale settings, where the number of samples and the feature dimension can grow as large as millions or billions. Two immediate issues occur under such challenging scenario: (i) computational cost; (ii) memory overhead. In particular, the memory issue precludes a large volume of prior algorithms that are based on batch optimization technique. To remedy the problem, we propose to learn sparse models such as Lasso in an online manner where in each iteration, only one randomly chosen sample is revealed to update a sparse iterate. Thereby, the memory cost is independent of the sample size and gradient evaluation for one sample is efficient. Perhaps amazingly, we find that with the same parameter, sparsity promoted by batch methods is not preserved in online fashion. We analyze such interesting phenomenon and illustrate some effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Statistical Methods and Inference · Machine Learning and Algorithms