High Dimensional Classification through $\ell_0$-Penalized Empirical Risk Minimization
Le-Yu Chen, Sokbae Lee

TL;DR
This paper introduces a high-dimensional binary classification method that minimizes empirical risk with an 0 penalty, achieving near-true sparsity and providing theoretical guarantees on performance.
Contribution
It develops a novel 0-penalized classification approach with non-asymptotic bounds and practical implementation via mixed integer programming.
Findings
Achieves high probability of near-true sparsity
Provides convergence rates for excess misclassification risk
Demonstrates effectiveness through Monte Carlo experiments
Abstract
We consider a high dimensional binary classification problem and construct a classification procedure by minimizing the empirical misclassification risk with a penalty on the number of selected features. We derive non-asymptotic probability bounds on the estimated sparsity as well as on the excess misclassification risk. In particular, we show that our method yields a sparse solution whose l0-norm can be arbitrarily close to true sparsity with high probability and obtain the rates of convergence for the excess misclassification risk. The proposed procedure is implemented via the method of mixed integer linear programming. Its numerical performance is illustrated in Monte Carlo experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Machine Learning and Algorithms
