Optimal Cross-Validation for Sparse Linear Regression
Ryan Cory-Wright, Andr\'es G\'omez

TL;DR
This paper develops efficient relaxations for k-fold cross-validation in sparse linear regression, significantly reducing computational costs while maintaining competitive hyperparameter selection accuracy.
Contribution
It introduces computationally tractable relaxations of the cross-validation loss, enabling fewer mixed-integer optimization problems to be solved in hyperparameter tuning.
Findings
Fewer MIOs needed for hyperparameter selection (50-80% reduction).
Relaxations perform competitively with existing software like glmnet and L0Learn.
Effective especially in small sample-to-feature ratio scenarios.
Abstract
Given a high-dimensional covariate matrix and a response vector, ridge-regularized sparse linear regression selects a subset of features that explains the relationship between covariates and the response in an interpretable manner. To choose hyperparameters that control the sparsity level and amount of regularization, practitioners commonly use k-fold cross-validation. However, cross-validation substantially increases the computational cost of sparse regression as it requires solving many mixed-integer optimization problems (MIOs) for each hyperparameter combination. To address this computational burden, we derive computationally tractable relaxations of the k-fold cross-validation loss, facilitating hyperparameter selection while solving -- fewer MIOs in practice. Our computational results demonstrate, across eleven real-world UCI datasets, that exact MIO-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Industrial Vision Systems and Defect Detection · Face and Expression Recognition
