Generalized asymptotic formulae for estimating statistical significance in high energy physics analyses
M. J. Basso (1) ((1) University of Toronto)

TL;DR
This paper develops generalized asymptotic formulas for estimating statistical significance in high energy physics, accommodating complex models with multiple regions and constraints, and validates these formulas with toy data.
Contribution
It introduces new generalized asymptotic expressions for significance estimation that handle complex measurement models in high energy physics.
Findings
Derived closed-form significance formulas under simplifying assumptions
Extended formulas to use the Asimov dataset for data-free estimates
Validated formulas with toy-based data simulations
Abstract
Within the framework of likelihood-based statistical tests for high energy physics measurements, we derive generalized expressions for estimating the statistical significance of discovery using the asymptotic approximations of Wilks and Wald for a variety of measurement models. These models include arbitrary numbers of signal regions, control regions, and Gaussian constraints. We extend our expressions to use the representative or "Asimov" dataset proposed by Cowan et al. such that they are made data-free. While many of the generalized expressions are complicated and often involve solving systems of coupled, multivariate equations, we show these expressions reduce to closed-form results under simplifying assumptions. We also validate the predicted significance using toy-based data in select cases.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Particle physics theoretical and experimental studies · Gaussian Processes and Bayesian Inference
