The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

Dimitris Bertsimas; Vassilis Digalakis Jr

arXiv:2006.06592·cs.LG·July 19, 2022

The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

Dimitris Bertsimas, Vassilis Digalakis Jr

PDF

TL;DR

The paper introduces the backbone method, a scalable framework for sparse, interpretable machine learning in ultra-high dimensional settings, significantly reducing computation time while maintaining accuracy.

Contribution

It proposes a two-phase backbone approach that efficiently identifies relevant features, enabling scalable sparse learning and interpretability in extremely high-dimensional data.

Findings

01

Solves sparse regression with 10^7 features in minutes

02

Handles decision trees with 10^5 features in minutes

03

Outperforms or matches state-of-the-art methods in high-dimensional tasks

Abstract

We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems. We solve sparse regression problems with $1 0^{7}$ features in minutes and $1 0^{8}$ features in hours, as well as decision tree problems with $1 0^{5}$ features in minutes.The proposed method operates in two phases: we first determine the backbone set, consisting of potentially relevant features, by solving a number of tractable subproblems; then, we solve a reduced problem, considering only the backbone features. For the sparse regression problem, our theoretical analysis shows that, under certain assumptions and with high probability, the backbone set consists of the truly relevant features. Numerical experiments on both synthetic and real-world datasets demonstrate that our method outperforms or competes with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.