Exact and empirical estimation of misclassification probability

Victor Nedelko

arXiv:1408.3332·stat.ML·August 15, 2014

Exact and empirical estimation of misclassification probability

Victor Nedelko

PDF

Open Access

TL;DR

This paper investigates risk estimation in classification, deriving analytic bounds for bias in empirical risk and exploring their use in empirical risk estimation, with a focus on confidence intervals.

Contribution

It introduces simple analytic approximations for the maximum bias of empirical risk in histogram classifiers and studies their application in risk estimation.

Findings

01

Derived analytic bounds for empirical risk bias

02

Analyzed the use of these bounds in risk estimation

03

Provided insights into confidence interval maximization

Abstract

We discuss the problem of risk estimation in the classification problem, with specific focus on finding distributions that maximize the confidence intervals of risk estimation. We derived simple analytic approximations for the maximum bias of empirical risk for histogram classifier. We carry out a detailed study on using these analytic estimates for empirical estimation of risk.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Machine Learning and Data Classification