Confidently Comparing Estimators with the c-value
Brian L. Trippe, Sameer K. Deshpande, Tamara Broderick

TL;DR
The paper introduces the c-value, a confidence measure for comparing estimators' performance, providing a practical alternative to traditional risk-based comparisons, with applications in Bayesian and hierarchical models.
Contribution
It proposes the c-value as a new frequentist metric for estimator comparison, including methods for its computation and validation in complex models.
Findings
c-value effectively indicates when a new estimator outperforms an old one
Large c-value correlates with smaller loss for the new estimate
Method applicable to hierarchical models and Gaussian processes
Abstract
Modern statistics provides an ever-expanding toolkit for estimating unknown parameters. Consequently, applied statisticians frequently face a difficult decision: retain a parameter estimate from a familiar method or replace it with an estimate from a newer or more complex one. While it is traditional to compare estimates using risk, such comparisons are rarely conclusive in realistic settings. In response, we propose the "c-value" as a measure of confidence that a new estimate achieves smaller loss than an old estimate on a given dataset. We show that it is unlikely that a large c-value coincides with a larger loss for the new estimate. Therefore, just as a small p-value supports rejecting a null hypothesis, a large c-value supports using a new estimate in place of the old. For a wide class of problems and estimates, we show how to compute a c-value by first constructing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
