High Dimensional Classification through $\ell_0$-Penalized Empirical   Risk Minimization

Le-Yu Chen; Sokbae Lee

arXiv:1811.09540·stat.ME·November 26, 2018·1 cites

High Dimensional Classification through $\ell_0$-Penalized Empirical Risk Minimization

Le-Yu Chen, Sokbae Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces a high-dimensional binary classification method that minimizes empirical risk with an 0 penalty, achieving near-true sparsity and providing theoretical guarantees on performance.

Contribution

It develops a novel 0-penalized classification approach with non-asymptotic bounds and practical implementation via mixed integer programming.

Findings

01

Achieves high probability of near-true sparsity

02

Provides convergence rates for excess misclassification risk

03

Demonstrates effectiveness through Monte Carlo experiments

Abstract

We consider a high dimensional binary classification problem and construct a classification procedure by minimizing the empirical misclassification risk with a penalty on the number of selected features. We derive non-asymptotic probability bounds on the estimated sparsity as well as on the excess misclassification risk. In particular, we show that our method yields a sparse solution whose l0-norm can be arbitrarily close to true sparsity with high probability and obtain the rates of convergence for the excess misclassification risk. The proposed procedure is implemented via the method of mixed integer linear programming. Its numerical performance is illustrated in Monte Carlo experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LeyuChen/L0-ERM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Machine Learning and Algorithms