Improving ALMA's data processing effciency using a holistic approach
Theodoros Nakos, Harold Francke, Kouichiro Nakanishi, Dirk Petry,, Thomas Stanke, Catarina Ubach, Luciano Cerrigone, Erica Keller, Alfonso, Trejo, Junko Ueda

TL;DR
This paper discusses ALMA's efforts to enhance data processing efficiency through a comprehensive approach, addressing operational challenges, optimization strategies, and future plans to handle increasing data volumes effectively.
Contribution
It introduces a holistic methodology for improving ALMA's data processing efficiency, including operational risk mitigation and strategic optimization initiatives.
Findings
Implemented strategies reduced processing backlog
Optimized workflows increased data throughput
Future plans aim to process 80% of datasets with fewer staff
Abstract
ALMA (Atacama Large Millimeter/submillimeter Array) is the world's largest ground-based facility for observations in the millimeter/submillimeter regime. One of ALMA's outstanding characteristics is the large effort dedicated to the quality assurance (QA) of the calibrated and imaged data products offered to the astronomical community. The Data Management Group (DMG), in charge of the data processing, review, and delivery of the ALMA data, consists of approximately 60 experts in data reduction, from the ALMA Regional Centers (ARCs) and the Joint ALMA Observatory (JAO), distributed in fourteen countries. With a throughput of more than 3,000 datasets per year, meeting the goal of delivering the pipeline-able data products within 30 days after data acquisition is a huge challenge. This paper presents (a) the history of data processing at ALMA, (b) the challenges our team had and is still…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
