Keeping Track of User Steering Actions in Dynamic Workflows
Renan Souza (UFRJ), V\'itor Silva (UFRJ), Jos\'e Camata (UFIJ), Alvaro, Coutinho (UFRJ), Patrick Valduriez (ZENITH), Marta Mattoso (UFRJ)

TL;DR
This paper presents a lightweight, real-time provenance management system for tracking user steering actions in long-running scientific workflows, enabling better analysis and tuning with minimal overhead.
Contribution
It introduces a novel online provenance capture method specifically for user-driven parameter tuning in dynamic workflows, with negligible performance impact.
Findings
Overhead for tracking steering actions is less than 1% of total execution time.
The system enables detailed analysis of tuning effects on performance and accuracy.
Validated in a real Oil and Gas industry workflow.
Abstract
In long-lasting scientific workflow executions in HPC machines, computational scientists (the users in this work) often need to fine-tune several workflow parameters. These tunings are done through user steering actions that may significantly improve performance (e.g., reduce execution time) or improve the overall results. However, in executions that last for weeks, users can lose track of what has been adapted if the tunings are not properly registered. In this work, we build on provenance data management to address the problem of tracking online parameter fine-tuning in dynamic workflows steered by users. We propose a lightweight solution to capture and manage provenance of the steering actions online with negligible overhead. The resulting provenance database relates tuning data with data for domain, dataflow provenance, execution, and performance, and is available for analysis at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
