Extreme Scale Survey Simulation with Python Workflows
A.S. Villarreal, Yadu Babuji, Tom Uram, Daniel S. Katz, Kyle Chard,, Katrin Heitmann

TL;DR
This paper presents an advanced Python-based workflow system that enables large-scale, distributed sky simulations for the LSST, facilitating preparation for upcoming astronomical data analysis and addressing computational challenges at extreme scales.
Contribution
It introduces a scalable, portable workflow framework combining Python, Parsl, and containers for simulating extensive astronomical data sets across multiple supercomputers.
Findings
Successfully simulated five years of observations over 300 sq. degrees.
Efficiently scaled the workflow to 4000 compute nodes across two supercomputers.
Provided insights and lessons learned in large-scale astronomical workflow development.
Abstract
The Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) will soon carry out an unprecedented wide, fast, and deep survey of the sky in multiple optical bands. The data from LSST will open up a new discovery space in astronomy and cosmology, simultaneously providing clues toward addressing burning issues of the day, such as the origin of dark energy and and the nature of dark matter, while at the same time yielding data that will, in turn, pose fresh new questions. To prepare for the imminent arrival of this remarkable data set, it is crucial that the associated scientific communities be able to develop the software needed to analyze it. Computational power now available allows us to generate synthetic data sets that can be used as a realistic training ground for such an effort. This effort raises its own challenges -- the need to generate very large simulations of the night…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Computational Physics and Python Applications
