Statistically Optimal Uncertainty Quantification for Expensive Black-Box Models
Shengyi He, Henry Lam

TL;DR
This paper develops a theoretical framework for constructing statistically optimal confidence intervals in the context of expensive black-box models, demonstrating that certain batching and resampling methods achieve minimal interval length under computational constraints.
Contribution
It introduces a new theoretical approach to identify the most efficient confidence interval methods for limited model evaluations, including novel formulas with uneven or overlapping batches.
Findings
Standard batching is optimal under constraints.
New batching formulas improve CI efficiency.
Resampling methods like cheap bootstrap are statistically optimal.
Abstract
Uncertainty quantification, by means of confidence interval (CI) construction, has been a fundamental problem in statistics and also important in risk-aware decision-making. In this paper, we revisit the basic problem of CI construction, but in the setting of expensive black-box models. This means we are confined to using a low number of model runs, and without the ability to obtain auxiliary model information such as gradients. In this case, there exist classical methods based on data splitting, and newer methods based on suitable resampling. However, while all these resulting CIs have similarly accurate coverage in large sample, their efficiencies in terms of interval length differ, and a systematic understanding of which method and configuration attains the shortest interval appears open. Motivated by this, we create a theoretical framework to study the statistical optimality on CI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Fault Detection and Control Systems
