Massive data compression for parameter-dependent covariance matrices
Alan Heavens, Elena Sellentin, Damien de Mijolla, Alvise Vianello

TL;DR
This paper demonstrates how the MOPED data compression algorithm can drastically reduce the number of simulations needed to estimate parameter-dependent covariance matrices in large-scale cosmological data analyses, enabling feasible analysis for upcoming surveys.
Contribution
It introduces the application of MOPED to efficiently compress data for covariance matrix estimation in parameter-dependent scenarios, significantly reducing computational costs.
Findings
MOPED reduces simulation requirements by a factor of 1000.
Assuming regularity in covariance matrices can reduce simulations by a factor of 10^6.
Enables feasible analysis of large cosmological datasets with limited simulations.
Abstract
We show how the massive data compression algorithm MOPED can be used to reduce, by orders of magnitude, the number of simulated datasets that are required to estimate the covariance matrix required for the analysis of gaussian-distributed data. This is relevant when the covariance matrix cannot be calculated directly. The compression is especially valuable when the covariance matrix varies with the model parameters. In this case, it may be prohibitively expensive to run enough simulations to estimate the full covariance matrix throughout the parameter space. This compression may be particularly valuable for the next-generation of weak lensing surveys, such as proposed for Euclid and LSST, for which the number of summary data (such as band power or shear correlation estimates) is very large, , due to the large number of tomographic redshift bins that the data will be divided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
