# Constraining Cosmology with Big Data Statistics of Cosmological Graphs

**Authors:** Sungryong Hong, Donghui Jeong, Ho Seong Hwang, Juhan Kim, Sungwook E., Hong, Changbom Park, Arjun Dey, Milos Milosavljevic, Karl Gebhardt and, Kyoung-Soo Lee

arXiv: 1903.07626 · 2020-03-04

## TL;DR

This paper demonstrates how large-scale graph analysis using Big Data tools can effectively distinguish different cosmological models by analyzing the topological structure of gravitational clustering in simulations.

## Contribution

It introduces a novel application of Big Data graph analytics to cosmology, identifying simple topological measures that discriminate among various universe models.

## Key findings

- Three graph-topological measures effectively differentiate cosmological models.
- Big Data infrastructure enables analysis of up to 200 million data points.
- Graph statistics relate directly to traditional correlation functions in cosmology.

## Abstract

By utilizing large-scale graph analytic tools implemented in the modern Big Data platform, Apache Spark, we investigate the topological structure of gravitational clustering in five different universes produced by cosmological $N$-body simulations with varying parameters: (1) a WMAP 5-year compatible $\Lambda$CDM cosmology, (2) two different dark energy equation of state variants, and (3) two different cosmic matter density variants. For the Big Data calculations, we use a custom build of stand-alone Spark/Hadoop cluster at Korea Institute for Advanced Study (KIAS) and Dataproc Compute Engine in Google Cloud Platform (GCP) with the sample size ranging from 7 millions to 200 millions. We find that among the many possible graph-topological measures, three simple ones: (1) the average of number of neighbors (the so-called average vertex degree) $\alpha$, (2) closed-to-connected triple fraction (the so-called transitivity) $\tau_\Delta$, and (3) the cumulative number density $n_{s\ge5}$ of subcomponents with connected component size $s \ge 5$, can effectively discriminate among the five model universes. Since these graph-topological measures are in direct relation with the usual $n$-points correlation functions of the cosmic density field, graph-topological statistics powered by Big Data computational infrastructure opens a new, intuitive, and computationally efficient window into the dark Universe.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.07626/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1903.07626/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1903.07626/full.md

---
Source: https://tomesphere.com/paper/1903.07626