Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference
Marvin Schmitt, Desi R. Ivanova, Daniel Habermann, Ullrich K\"othe,, Paul-Christian B\"urkner, Stefan T. Radev

TL;DR
This paper introduces a self-consistency loss to enhance data efficiency and accuracy in amortized Bayesian inference by exploiting symmetries in the joint model, improving neural density estimation especially in low data scenarios.
Contribution
It proposes a novel self-consistency loss that penalizes symmetry violations, leading to better approximate inference in Bayesian models, applicable to neural posterior and likelihood estimation.
Findings
Improved inference accuracy in synthetic and scientific models
Enhanced neural density estimator training with self-consistency loss
Significant benefits in low data regimes
Abstract
We propose a method to improve the efficiency and accuracy of amortized Bayesian inference by leveraging universal symmetries in the joint probabilistic model of parameters and data. In a nutshell, we invert Bayes' theorem and estimate the marginal likelihood based on approximate representations of the joint model. Upon perfect approximation, the marginal likelihood is constant across all parameter values by definition. However, errors in approximate inference lead to undesirable variance in the marginal likelihood estimates across different parameter values. We penalize violations of this symmetry with a \textit{self-consistency loss} which significantly improves the quality of approximate inference in low data regimes and can be used to augment the training of popular neural density estimators. We apply our method to a number of synthetic problems and realistic scientific models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Bayesian Methods and Mixture Models
