Sparse Hierarchical Regression with Polynomials

Dimitris Bertsimas; Bart Van Parys

arXiv:1709.10030·math.OC·September 29, 2017

Sparse Hierarchical Regression with Polynomials

Dimitris Bertsimas, Bart Van Parys

PDF

TL;DR

This paper introduces an exact hierarchical sparse polynomial regression method that efficiently identifies relevant inputs and monomials in high-dimensional data, balancing model complexity and prediction accuracy.

Contribution

It presents a novel two-step approach combining input ranking heuristics and cutting plane optimization to achieve exact sparse polynomial regression.

Findings

01

Method accurately identifies relevant features and monomials.

02

Phase transition observed in feature selection performance.

03

Scales to datasets with approximately 10,000 observations and 1,000 inputs.

Abstract

We present a novel method for exact hierarchical sparse polynomial regression. Our regressor is that degree $r$ polynomial which depends on at most $k$ inputs, counting at most $ℓ$ monomial terms, which minimizes the sum of the squares of its prediction errors. The previous hierarchical sparse specification aligns well with modern big data settings where many inputs are not relevant for prediction purposes and the functional complexity of the regressor needs to be controlled as to avoid overfitting. We present a two-step approach to this hierarchical sparse regression problem. First, we discard irrelevant inputs using an extremely fast input ranking heuristic. Secondly, we take advantage of modern cutting plane methods for integer optimization to solve our resulting reduced hierarchical $(k, ℓ)$ -sparse problem exactly. The ability of our method to identify all $k$ relevant inputs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.