Distributing Sparse Matrix/Graph Applications in Heterogeneous Clusters -- an Experimental Study
Charilaos Tzovas, Maria Predari, Henning Meyerhenke

TL;DR
This paper investigates how to effectively distribute sparse matrix and graph workloads across heterogeneous cluster systems with CPUs and GPUs, proposing a new approach and evaluating existing tools for optimal load balancing.
Contribution
It formulates the load distribution problem for heterogeneous architectures as a multi-objective optimization and evaluates existing partitioners, including a new extension called Geographer.
Findings
Parmatis and Geographer produce good quality partitions.
Geographer yields better quality than Parmatis on average.
Only two tools handle the heterogeneous distribution problem effectively.
Abstract
Many problems in scientific and engineering applications contain sparse matrices or graphs as main input objects, e.g. numerical simulations on meshes. Large inputs are abundant these days and require parallel processing for memory size and speed. To optimize the execution of such simulations on cluster systems, the input problem needs to be distributed suitably onto the processing units (PUs). More and more frequently, such clusters contain different CPUs or a combination of CPUs and GPUs. This heterogeneity makes the load distribution problem quite challenging. Our study is motivated by the observation that established partitioning tools do not handle such heterogeneous distribution problems as well as homogeneous ones. In this paper, we first formulate the problem of balanced load distribution for heterogeneous architectures as a multi-objective, single-constraint optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
