Learning Cosmology from Nearest Neighbour Statistics
Atrideb Chatterjee, Arka Banerjee, Francisco Villaescusa-Navarro, and Tom Abel

TL;DR
This paper introduces NN distance maps combined with kNN-based statistics and neural networks to improve the accuracy and efficiency of cosmological parameter inference from galaxy catalogues.
Contribution
It presents a novel field-level representation called NN distance maps and a hybrid neural network approach that outperforms existing methods in accuracy and computational efficiency.
Findings
Achieves state-of-the-art constraints on cosmological parameters.
Hybrid method is 5-10 times more efficient than previous point-cloud ML methods.
Demonstrates the effectiveness of NN distance maps with kNN statistics on Quijote simulations.
Abstract
Extracting cosmological parameters from galaxy/halo catalogues with sub-percent level accuracy is an important aspect of modern cosmology, especially in view of ongoing and upcoming surveys such as Euclid, DESI, and LSST. While traditional two-point statistics have been known to be suboptimal for this task, recently proposed k-Nearest Neighbour (kNN) based summary statistics have demonstrated tighter constraining power. Building on the kNN statistics, we introduce a new field-level representation of discrete halo catalogues - NN distance maps. We employ this technique on the halo catalogues obtained from Quijote N-body simulation suites. By combining these maps with kNN-based summary statistics, we train a hybrid neural network to infer cosmological parameters, showing that the resulting constraints achieve state-of-the-art, if not the best, accuracy. In addition, our hybrid framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGalaxies: Formation, Evolution, Phenomena · Topological and Geometric Data Analysis · Gaussian Processes and Bayesian Inference
