Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis
Paolo Braca, Leonardo M. Millefiori, Augusto Aubry, Stefano Marano,, Antonio De Maio, Peter Willett

TL;DR
This paper applies large deviations theory to analyze the exponential decay rate of error probabilities in machine learning classification, linking it to the training data and the learned decision function.
Contribution
It introduces a mathematical framework using large deviations to quantify how quickly ML classification error probabilities vanish, based on the data-driven decision function and its properties.
Findings
Error probabilities decay exponentially with the number of observations.
The error rate can be numerically verified using datasets or synthetic data.
Asymptotic Gaussian convergence of the decision function enables setting false alarm probabilities.
Abstract
We study the performance -- and specifically the rate at which the error probability converges to zero -- of Machine Learning (ML) classification techniques. Leveraging the theory of large deviations, we provide the mathematical conditions for a ML classifier to exhibit error probabilities that vanish exponentially, say , where is the number of informative observations available for testing (or another relevant parameter, such as the size of the target in an image) and is the error rate. Such conditions depend on the Fenchel-Legendre transform of the cumulant-generating function of the Data-Driven Decision Function (D3F, i.e., what is thresholded before the final binary decision is made) learned in the training phase. As such, the D3F and, consequently, the related error rate , depend on the given training set, which is assumed of finite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Fault Detection and Control Systems · Anomaly Detection Techniques and Applications
