Improving the Accuracy of Amortized Model Comparison with Self-Consistency

\v{S}imon Kucharsk\'y; Aayush Mishra; Daniel Habermann; Stefan T. Radev; Paul-Christian B\"urkner

arXiv:2512.14308·stat.ML·January 27, 2026

Improving the Accuracy of Amortized Model Comparison with Self-Consistency

\v{S}imon Kucharsk\'y, Aayush Mishra, Daniel Habermann, Stefan T. Radev, Paul-Christian B\"urkner

PDF

Open Access

TL;DR

This paper demonstrates that self-consistency training enhances the robustness and accuracy of amortized Bayesian model comparison, especially under model misspecification, by focusing on parameter posteriors rather than model evidence.

Contribution

It investigates how self-consistency improves amortized model comparison, providing practical guidance for more reliable Bayesian inference under misspecification.

Findings

01

SC training improves robustness with likelihood access under severe misspecification

02

Parameter posterior-based methods outperform evidence-based approaches

03

Augmenting methods with SC reduces extrapolation bias in empirical data

Abstract

Amortized Bayesian inference (ABI) offers fast, scalable approximations to posterior densities by training neural surrogates on data simulated from the statistical model. However, ABI methods are highly sensitive to model misspecification: when observed data fall outside the training distribution (generative scope of the statistical models), neural surrogates can behave unpredictably. This makes it a challenge in a model comparison setting, where multiple statistical models are considered, of which at least some are misspecified. Recent work on self-consistency (SC) provides a promising remedy to this issue, accessible even for empirical data (without ground-truth labels). In this work, we investigate how SC can improve amortized model comparison conceptualized in four different ways. Across two synthetic and two real-world case studies, we find that approaches for model comparison that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning