IVOA Provenance data model: hints from the CTA Provenance prototype

Mich\`ele Sanguillon; Mathieu Servillat; Mireille Louys; Fran\c{c}ois; Bonnarel; Catherine Boisson; Johan Br\'egeon

arXiv:1601.02491·astro-ph.IM·January 12, 2016·2 cites

IVOA Provenance data model: hints from the CTA Provenance prototype

Mich\`ele Sanguillon, Mathieu Servillat, Mireille Louys, Fran\c{c}ois, Bonnarel, Catherine Boisson, Johan Br\'egeon

PDF

Open Access

TL;DR

This paper develops an IVOA Provenance data model based on W3C PROV concepts, using the Cherenkov Telescope Array as a test case to improve provenance tracking in high-energy astrophysics data processing.

Contribution

It introduces a new IVOA Provenance data model tailored for Cherenkov astronomy, integrating W3C PROV standards and demonstrating its application in CTA data pipelines.

Findings

01

Provenance information is essential for interpreting high-level data products.

02

The proposed model effectively captures computational provenance in CTA workflows.

03

W3C PROV notations facilitate standardized provenance representation.

Abstract

We present the last developments on the IVOA Provenance data model, mainly based on the W3C PROV concept. In the context of the Cherenkov astronomy, the data processing stages imply both assumptions and comparison to dedicated simulations. As a consequence, Provenance information is crucial to the end user in order to interpret the high level data products. The Cherenkov Telescope Array (CTA), currently in preparation, is thus a perfect test case for the development of an IVOA standard on Provenance information. We describe general use-cases for the computational Provenance in the CTA production pipeline and explore the proposed W3C notations like PROV-N formats, as well as Provenance access solutions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Environmental Monitoring and Data Management · Distributed and Parallel Computing Systems