Confidence Intervals for Evaluation of Data Mining

Zheng Yuan; Wenxin Jiang

arXiv:2502.07016·stat.ML·July 8, 2025

Confidence Intervals for Evaluation of Data Mining

Zheng Yuan, Wenxin Jiang

PDF

Open Access

TL;DR

This paper develops fast, asymptotic confidence intervals for various data mining performance measures, enabling statistically sound comparisons of classification rules without extensive resampling.

Contribution

It introduces a novel 'blurring correction' for variance, extending the plus-four method to general performance measures in data mining.

Findings

01

Confidence intervals achieve good finite sample coverage.

02

The method allows simultaneous inference on multiple measures.

03

It avoids computationally intensive bootstrap resampling.

Abstract

In data mining, when binary prediction rules are used to predict a binary outcome, many performance measures are used in a vast array of literature for the purposes of evaluation and comparison. Some examples include classification accuracy, precision, recall, F measures, and Jaccard index. Typically, these performance measures are only approximately estimated from a finite dataset, which may lead to findings that are not statistically significant. In order to properly quantify such statistical uncertainty, it is important to provide confidence intervals associated with these estimated performance measures. We consider statistical inference about general performance measures used in data mining, with both individual and joint confidence intervals. These confidence intervals are based on asymptotic normal approximations and can be computed fast, without needs to do bootstrap resampling.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications