The surprising strength of weak classifiers for validating neural posterior estimates
Vansh Bansal, Tianyu Chen, James G. Scott

TL;DR
This paper demonstrates that weak classifiers, even if overfitted or biased, can reliably validate neural posterior estimates using a conformal C2ST method that controls Type-I error and maintains power.
Contribution
The authors introduce a conformal C2ST approach that converts any trained classifier's scores into valid p-values, enabling reliable validation of neural posterior estimates with weak classifiers.
Findings
Conformal C2ST controls Type-I error with finite samples.
Weak classifiers still provide powerful validation tests.
Empirical results outperform classical discriminative tests.
Abstract
Neural Posterior Estimation (NPE) has emerged as a powerful approach for amortized Bayesian inference when the true posterior is intractable or difficult to sample. But evaluating the accuracy of neural posterior estimates remains challenging, with existing methods suffering from major limitations. One appealing and widely used method is the classifier two-sample test (C2ST), where a classifier is trained to distinguish samples from the true posterior versus the learned NPE approximation . Yet despite the appealing simplicity of the C2ST, its theoretical and practical reliability depend upon having access to a near-Bayes-optimal classifier -- a requirement that is rarely met and, at best, difficult to verify. Thus a major open question is: can a weak classifier still be useful for neural posterior validation? We show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
