Task Runtime Prediction in Scientific Workflows Using an Online Incremental Learning Approach
Muhammad H. Hilman, Maria A. Rodriguez, Rajkumar Buyya

TL;DR
This paper introduces an online incremental learning method for predicting task runtimes in scientific workflows on cloud platforms, utilizing real-time resource monitoring data to enhance accuracy.
Contribution
It presents a novel online learning approach that adaptively predicts task runtimes using time-series resource data, outperforming existing regression-based methods.
Findings
Improves prediction accuracy by up to 29.89% over state-of-the-art methods.
Utilizes fine-grained resource monitoring data for better modeling.
Demonstrates effectiveness in near real-time cloud environments.
Abstract
Many algorithms in workflow scheduling and resource provisioning rely on the performance estimation of tasks to produce a scheduling plan. A profiler that is capable of modeling the execution of tasks and predicting their runtime accurately, therefore, becomes an essential part of any Workflow Management System (WMS). With the emergence of multi-tenant Workflow as a Service (WaaS) platforms that use clouds for deploying scientific workflows, task runtime prediction becomes more challenging because it requires the processing of a significant amount of data in a near real-time scenario while dealing with the performance variability of cloud resources. Hence, relying on methods such as profiling tasks' execution data using basic statistical description (e.g., mean, standard deviation) or batch offline regression techniques to estimate the runtime may not be suitable for such environments.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
