Statistical Consistency of Finite-dimensional Unregularized Linear   Classification

Matus Telgarsky

arXiv:1206.3072·cs.LG·June 15, 2012

Statistical Consistency of Finite-dimensional Unregularized Linear Classification

Matus Telgarsky

PDF

Open Access

TL;DR

This paper analyzes the statistical consistency of finite-dimensional unregularized linear classifiers, including logistic regression and boosting, showing they can achieve optimal risk with increasing sample size.

Contribution

It provides a finite-dimensional analysis of unregularized linear classifiers' consistency, encompassing various loss functions and boosting, with results on optimal risk convergence.

Findings

01

Linear classifiers are statistically consistent in finite dimensions.

02

Scaling complexity with sample size leads to near-optimal risk.

03

The analysis applies to logistic regression and boosting with various losses.

Abstract

This manuscript studies statistical properties of linear classifiers obtained through minimization of an unregularized convex risk over a finite sample. Although the results are explicitly finite-dimensional, inputs may be passed through feature maps; in this way, in addition to treating the consistency of logistic regression, this analysis also handles boosting over a finite weak learning class with, for instance, the exponential, logistic, and hinge losses. In this finite-dimensional setting, it is still possible to fit arbitrary decision boundaries: scaling the complexity of the weak learning class with the sample size leads to the optimal classification risk almost surely.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Advanced Statistical Methods and Models