Mapping Large Memory-constrained Workflows onto Heterogeneous Platforms
Svetlana Kulagina, Henning Meyerhenke, Anne Benoit

TL;DR
This paper introduces a four-step heuristic for efficiently mapping large, memory-constrained workflows represented as DAGs onto heterogeneous platforms, significantly reducing execution time compared to baseline methods.
Contribution
It presents a novel four-step heuristic that partitions, adapts, and optimizes workflow mapping on heterogeneous platforms considering memory constraints.
Findings
Heuristic reduces makespan by 2.44 times on average.
Effective handling of large workflows with up to 30,000 tasks.
Significant improvements over baseline methods ignoring heterogeneity.
Abstract
Scientific workflows are often represented as directed acyclic graphs (DAGs), where vertices correspond to tasks and edges represent the dependencies between them. Since these graphs are often large in both the number of tasks and their resource requirements, it is important to schedule them efficiently on parallel or distributed compute systems. Typically, each task requires a certain amount of memory to be executed and needs to communicate data to its successor tasks. The goal is thus to execute the workflow as fast as possible (i.e., to minimize its makespan) while satisfying the memory constraints. Hence, we investigate the partitioning and mapping of DAG-shaped workflows onto heterogeneous platforms where each processor can have a different speed and a different memory size. We first propose a baseline algorithm in the absence of existing memory-aware solutions. As our main…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
