Adaptive Data Reduction Workflows for Astronomy -- The ESO Data Processing System (EDPS)
Wolfram Freudling, Stefano Zampieri, Lodovico Coccato, Stanislaw, Podgorski, Martino Romaniello, Andrea Modigliani, John Pritchard

TL;DR
This paper introduces the ESO Data Processing System (EDPS), an automated workflow generation system for astronomical data reduction that simplifies pipeline management and enhances flexibility.
Contribution
The paper presents a formalized, Python-based system that automatically derives processing workflows from a single pipeline specification, streamlining data reduction at ESO.
Findings
Successfully derived workflows for ESO's quality control and archive production
Demonstrated system's ability to replace multiple existing data reduction systems
Automated workflow generation improves efficiency and consistency
Abstract
Astronomical data reduction is usually done with processing pipelines that consist of a series of individual processing steps that can be executed stand-alone. These processing steps are then strung together into workflows and fed with data to address a particular processing goal. In this paper, we propose a data processing system that automatically derives processing workflows for different use cases from a single specification of a cascade of processing steps. The system works by using formalized descriptions of data processing pipelines that specify the input and output of each processing step. Inputs can be existing data or the output of a previous step. Rules to select the most appropriate input data are directly attached to the description. A version of the proposed system has been implemented as the ESO Data Processing System (EDPS) in the Python language. The specification of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Astronomy and Astrophysical Research · Distributed and Parallel Computing Systems
