Coverage is not enough: Frequentist tests of simulation-based inference for primordial non-Gaussianity
Toka Alokda, Cristiano Porciani, Alexander Eggemeier

TL;DR
This paper evaluates the reliability of simulation-based inference (SBI) for cosmological parameters, revealing limitations of coverage diagnostics and highlighting the importance of posterior shape validation beyond standard tests.
Contribution
It compares SBI with likelihood-based inference for primordial non-Gaussianity, demonstrating discrepancies in posterior tails and emphasizing the need for improved validation strategies.
Findings
SBI and LBI agree on posterior means and skewness.
Discrepancies in kurtosis indicate differences in posterior tails.
Higher-order statistics like WST tighten constraints on $f_\mathrm{NL}$.
Abstract
(Abridged) Simulation-based inference (SBI) has emerged as a powerful framework for extracting cosmological information from complex, non-linear data where analytical likelihoods are unavailable. Its reliability is commonly assessed using coverage-based diagnostics under the prior predictive distribution, which probe calibration only in an averaged sense and do not constrain posterior behavior at fixed parameter value, the regime relevant for practical inference. We investigate these limitations in the context of primordial non-Gaussianity, parameterized by , using simulations of the dark matter halo field. We compare SBI based on contrastive neural ratio estimation (CNRE) with likelihood-based inference (LBI) using the power spectrum, bispectrum, and wavelet scattering transform (WST) coefficients across 1000 realizations. SBI and LBI agree well on posterior means and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
