Improving the Accuracy of Amortized Model Comparison with Self-Consistency
\v{S}imon Kucharsk\'y, Aayush Mishra, Daniel Habermann, Stefan T. Radev, Paul-Christian B\"urkner

TL;DR
This paper demonstrates that self-consistency training enhances the robustness and accuracy of amortized Bayesian model comparison, especially under model misspecification, by focusing on parameter posteriors rather than model evidence.
Contribution
It investigates how self-consistency improves amortized model comparison, providing practical guidance for more reliable Bayesian inference under misspecification.
Findings
SC training improves robustness with likelihood access under severe misspecification
Parameter posterior-based methods outperform evidence-based approaches
Augmenting methods with SC reduces extrapolation bias in empirical data
Abstract
Amortized Bayesian inference (ABI) offers fast, scalable approximations to posterior densities by training neural surrogates on data simulated from the statistical model. However, ABI methods are highly sensitive to model misspecification: when observed data fall outside the training distribution (generative scope of the statistical models), neural surrogates can behave unpredictably. This makes it a challenge in a model comparison setting, where multiple statistical models are considered, of which at least some are misspecified. Recent work on self-consistency (SC) provides a promising remedy to this issue, accessible even for empirical data (without ground-truth labels). In this work, we investigate how SC can improve amortized model comparison conceptualized in four different ways. Across two synthetic and two real-world case studies, we find that approaches for model comparison that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning
