Data Compression and Inference in Cosmology with Self-Supervised Machine Learning
Aizhan Akhmetzhanova, Siddharth Mishra-Sharma, Cora Dvorkin

TL;DR
This paper presents a self-supervised machine learning approach for compressing large cosmological datasets into informative summaries that facilitate accurate parameter inference and are robust against systematic effects.
Contribution
It introduces a novel self-supervised learning method for data compression in cosmology, enabling efficient data summarization and robust inference from simulations.
Findings
Achieves highly informative data summaries for cosmological simulations
Enables parameter inference with minimal information loss
Constructs summaries insensitive to systematic effects like baryonic physics
Abstract
The influx of massive amounts of data from current and upcoming cosmological surveys necessitates compression schemes that can efficiently summarize the data with minimal loss of information. We introduce a method that leverages the paradigm of self-supervised machine learning in a novel manner to construct representative summaries of massive datasets using simulation-based augmentations. Deploying the method on hydrodynamical cosmological simulations, we show that it can deliver highly informative summaries, which can be used for a variety of downstream tasks, including precise and accurate parameter inference. We demonstrate how this paradigm can be used to construct summary representations that are insensitive to prescribed systematic effects, such as the influence of baryonic physics. Our results indicate that self-supervised machine learning techniques offer a promising new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Algorithms and Data Compression · Advanced Data Storage Technologies
