Calculating a confidence interval on the sum of binned leakage
Ian Ruchlin, Richard W. Schnee

TL;DR
This paper presents an efficient algorithm for calculating confidence intervals on the total number of misclassified events across multiple bins, improving accuracy and computational efficiency in rare-event search analyses.
Contribution
It introduces a novel method using a Lagrange multiplier to compute confidence intervals on summed binomial data, reducing computational complexity.
Findings
Quadratic time complexity in the number of bins
Produces slightly conservative confidence intervals
Applicable to rare-event search calibration data
Abstract
Calculating the expected number of misclassified outcomes is a standard problem of particular interest for rare-event searches. The Clopper-Pearson method allows calculation of classical confidence intervals on the amount of misclassification if data are all drawn from the same binomial probability distribution. However, data is often better described by breaking it up into several bins, each represented by a different binomial distribution. We describe and provide an algorithm for calculating a classical confidence interval on the expected total number of misclassified events from several bins, based on calibration data with the same probability of misclassification on a bin-by-bin basis. Our method avoids a computationally intensive multidimensional search by introducing a Lagrange multiplier and performing standard root finding. This method has only quadratic time complexity as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
