On combining estimated and analytic covariance matrices
Alan Heavens, Lorne Whiteway, Elena Sellentin

TL;DR
This paper derives an accurate approximation for the combined likelihood function in cosmological data analysis, accounting for both estimated and analytic covariance matrices, by modeling it as a multivariate Student-t distribution.
Contribution
It introduces a new approximation method for the combined likelihood, matching covariance and kurtosis, and provides a fast sampling algorithm based on Student-t distribution properties.
Findings
The combined likelihood can be accurately modeled as a Student-t distribution.
The parameters are set by matching covariance and kurtosis to the true distribution.
A fast sampling algorithm avoids the approximation, based on Student-t mixture representation.
Abstract
The statistical analysis of cosmological data often assumes a Gaussian sampling distribution and relies on covariance matrices estimated from simulations. In this setting, the likelihood function of the data is not Gaussian but is instead a multivariate Student-t distribution, arising from marginalisation over an inverse-Wishart distribution for the true covariance matrix. This framework, introduced by Sellentin & Heavens(2016) and extended by Percival et al.(2022), provides a principled drop-in replacement to the Gaussian likelihood with Hartlap correction (Hartlap et al. 2007). The latter removes bias in the precision matrix; it is still widely used, despite failing to reproduce the heavy tails of the true distribution (thus yielding inaccurate probabilities, especially in the case of tensions between datasets). In practice, cosmological analyses frequently involve additional Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
