Processing of GASKAP-HI pilot survey data using a commercial supercomputer
Ian P. Kemp, Nickolas M. Pingel, Rowan Worth, Justin Wake, Daniel A., Mitchell, Stuart D. Midgely, Steven J. Tingay, James Dempsey, Helga D\'enes,, John M. Dickey, Steven J. Gibson, Kate E. Jameson, Callum Lynn, Yik Ki Ma,, Antoine Marchal, Naomi M. McClure-Griffiths

TL;DR
This paper explores using commercial supercomputers for processing large radio astronomy data, demonstrating a scalable workflow, cost optimization, and highlighting benefits like high availability alongside challenges such as required HPC expertise.
Contribution
It presents a four-step process for porting radio astronomy workflows to commercial HPC, with practical insights and resource estimates for large-scale data processing.
Findings
Commercial HPC offers immediate access and high availability.
Workflow optimization reduces processing costs and time.
Lessons learned inform future large-scale radio astronomy data processing.
Abstract
Modern radio telescopes generate large amounts of data, with the next generation Very Large Array (ngVLA) and the Square Kilometre Array (SKA) expected to feed up to 292 GB of visibilities per second to the science data processor (SDP). However, the continued exponential growth in the power of the world's largest supercomputers suggests that for the foreseeable future there will be sufficient capacity available to provide for astronomers' needs in processing 'science ready' products from the new generation of telescopes, with commercial platforms becoming an option for overflow capacity. The purpose of the current work is to trial the use of commercial high performance computing (HPC) for a large scale processing task in astronomy, in this case processing data from the GASKAP-HI pilot surveys. We delineate a four-step process which can be followed by other researchers wishing to port an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-time simulation and control systems · Advanced Data Processing Techniques
