Probabilistic cosmic web classification using fast-generated training data
Brandon Buncher, Matias Carrasco Kind

TL;DR
This paper introduces a fast, supervised machine learning method for probabilistic classification of cosmic web structures in three dimensions, using simplified training data to achieve accurate results comparable to more complex simulations.
Contribution
It presents a novel, scalable approach that uses a simplified toy model for training, enabling rapid and accurate cosmic web classification without reliance on detailed physical modeling.
Findings
Accurate classification of cosmic web particles using local density and directionality features.
The method is scalable and effective across different datasets and environments.
Training on simplified data does not compromise classification accuracy.
Abstract
We present a novel method of robust probabilistic cosmic web particle classification in three dimensions using a supervised machine learning algorithm. Training data was generated using a simplified CDM toy model with pre-determined algorithms for generating halos, filaments, and voids. While this framework is not constrained by physical modeling, it can be generated substantially more quickly than an N-body simulation without loss in classification accuracy. For each particle in this dataset, measurements were taken of the local density field magnitude and directionality. These measurements were used to train a random forest algorithm, which was used to assign class probabilities to each particle in a CDM, dark matter-only N-body simulation with particles, as well as on another toy model data set. By comparing the trends in the ROC curves and other statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
