When Provenance Aids and Complicates Reproducibility Judgments

David Koop

arXiv:2308.06894·cs.HC·August 15, 2023

When Provenance Aids and Complicates Reproducibility Judgments

David Koop

PDF

Open Access

TL;DR

This paper examines how data and interaction provenance influence reproducibility judgments in visualization, highlighting both their benefits and limitations in conveying meaningful insights.

Contribution

It introduces scenarios demonstrating the complex role of provenance in reproducibility and discusses ways to better relate provenance to visualizations.

Findings

01

Provenance can both aid and hinder reproducibility assessments.

02

Interaction and insight provenance provide context but do not guarantee reproducibility.

03

Provenance's utility depends on how well it captures meaningful aspects of visualization processes.

Abstract

It is well-established that the provenance of a scientific result is important, sometimes more important than the actual result. For computational analyses that involve visualization, this provenance information may contain the steps involved in generating visualizations from raw data. Specifically, data provenance tracks the lineage of data and process provenance tracks the steps executed. In this paper, we argue that the utility of computational provenance may not be as clear-cut as we might like. One common use case for provenance is that the information can be used to reproduce the original result. However, in visualization, the goal is often to communicate results to a user or viewer, and thus the insights obtained are ultimately most important. Viewers can miss important changes or react to unimportant ones. Here, interaction provenance, which tracks a user's actions with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Data Visualization and Analytics · Cell Image Analysis Techniques