Enhanced Inversion of Schema Evolution with Provenance
Tanja Auge, Andreas Heuer

TL;DR
This paper enhances the inversion of schema evolution by defining operators and provenance requirements, enabling better reconstruction and reproducibility of evolving scientific databases over long periods.
Contribution
It classifies schema modification operators, defines their inverses with provenance considerations, and extends schema mapping theory to improve reproducibility in evolving databases.
Findings
Provenance information enables near-complete reconstruction of research data.
Defined four classes of schema evolution scenarios with representative methods.
Extended schema mapping theory to incorporate provenance for better data reproducibility.
Abstract
Long-term data-driven studies have become indispensable in many areas of science. Often, the data formats, structures and semantics of data change over time, the data sets evolve. Therefore, studies over several decades in particular have to consider changing database schemas. The evolution of these databases lead at some point to a large number of schemas, which have to be stored and managed, costly and time-consuming. However, in the sense of reproducibility of research data each database version must be reconstructable with little effort. So a previously published result can be validated and reproduced at any time. Nevertheless, in many cases, such an evolution can not be fully reconstructed. This article classifies the 15 most frequently used schema modification operators and defines the associated inverses for each operation. For avoiding an information loss, it furthermore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Scientific Computing and Data Management · Semantic Web and Ontologies
