Simulation-based inference with deep ensembles: Evaluating calibration uncertainty and detecting model misspecification

James Alvey; Carlo R. Contaldi; Mauro Pieroni

arXiv:2507.13495·astro-ph.IM·January 21, 2026·Mach. Learn. Sci. Technol.

Simulation-based inference with deep ensembles: Evaluating calibration uncertainty and detecting model misspecification

James Alvey, Carlo R. Contaldi, Mauro Pieroni

PDF

TL;DR

This paper introduces an ensemble-based diagnostic method for Simulation-Based Inference (SBI) that assesses the consistency of posterior estimates, helping detect model misspecification and quantify uncertainty without needing the true posterior.

Contribution

The authors propose a simple, scalable ensemble learning diagnostic for SBI that evaluates internal consistency and detects model misspecification without access to the true posterior.

Findings

01

Ensemble KL divergence correlates with convergence issues.

02

The method detects model misspecification effectively.

03

Provides a scalable tool for SBI validation.

Abstract

Simulation-Based Inference (SBI) offers a principled and flexible framework for conducting Bayesian inference in any situation where forward simulations are feasible. However, validating the accuracy and reliability of the inferred posteriors remains a persistent challenge. In this work, we point out a simple diagnostic approach rooted in ensemble learning methods to assess the internal consistency of SBI outputs that does not require access to the true posterior. By training multiple neural estimators under identical conditions and evaluating their pairwise Kullback-Leibler (KL) divergences, we define a consistency criterion that quantifies agreement across the ensemble. We highlight two core use cases for this framework: a) for generating a robust estimate of the systematic uncertainty in parameter reconstruction associated with the training procedure, and b) for detecting possible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.