The design strategy of scientific data quality control software for Euclid mission
Massimo Brescia, Stefano Cavuoti, Terje Fredvik, Stein Vidar Hagfors, Haugan, Ghassem Gozaliasl, Charles Kirkpatrick, Hannu Kurki-Suonio, Giuseppe, Longo, Kari Nilsson, Martin Wiesmann

TL;DR
This paper discusses the design strategy for a comprehensive data quality control software system for the Euclid space mission, emphasizing automation, standardization, and integration within a large-scale, parallel data processing pipeline.
Contribution
It introduces a novel design approach for the Data Quality Common Tools tailored to the Euclid mission's complex, parallel pipeline environment, ensuring coherent quality evaluation across all stages.
Findings
Design strategy emphasizes automation and standardization.
Tools are integrated across pipeline stages for efficiency.
Focus on maintaining data quality in large-scale, parallel processing.
Abstract
The most valuable asset of a space mission like Euclid are the data. Due to their huge volume, the automatic quality control becomes a crucial aspect over the entire lifetime of the experiment. Here we focus on the design strategy for the Science Ground Segment (SGS) Data Quality Common Tools (DQCT), which has the main role to provide software solutions to gather, evaluate, and record quality information about the raw and derived data products from a primarily scientific perspective. The SGS DQCT will provide a quantitative basis for evaluating the application of reduction and calibration reference data, as well as diagnostic tools for quality parameters, flags, trend analysis diagrams and any other metadata parameter produced by the pipeline. In a large programme like Euclid, it is prohibitively expensive to process large amount of data at the pixel level just for the purpose of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Data Quality and Management
