Parallel HOP: A Scalable Halo Finder for Massive Cosmological Data Sets
Stephen Skory (1), Matthew J. Turk (1), Michael L. Norman (1), and, Alison L. Coil (1) ((1) Center for Astrophysics, Space Sciences,, University of California, San Diego)

TL;DR
Parallel HOP is a scalable, MPI-based halo finder for massive cosmological datasets, enabling efficient analysis of billion-particle simulations by distributing the workload across multiple compute nodes.
Contribution
It introduces a parallel implementation of the HOP halo finder using MPI and domain decomposition, allowing analysis of larger datasets than previous serial or parallel versions.
Findings
Scales well up to several hundred tasks
Handles datasets with over 2000^3 particles
Integrated into the yt analysis toolkit
Abstract
Modern N-body cosmological simulations contain billions () of dark matter particles. These simulations require hundreds to thousands of gigabytes of memory, and employ hundreds to tens of thousands of processing cores on many compute nodes. In order to study the distribution of dark matter in a cosmological simulation, the dark matter halos must be identified using a halo finder, which establishes the halo membership of every particle in the simulation. The resources required for halo finding are similar to the requirements for the simulation itself. In particular, simulations have become too extensive to use commonly-employed halo finders, such that the computational requirements to identify halos must now be spread across multiple nodes and cores. Here we present a scalable-parallel halo finding method called Parallel HOP for large-scale cosmological simulation data. Based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems · Simulation Techniques and Applications
