TL;DR
This paper introduces a novel MPI-based parallel algorithm for mapping complex, large-scale graphs onto hierarchical distributed systems, integrating partitioning and mapping to improve scalability and quality.
Contribution
It presents the first public implementation of a parallel graph mapping algorithm that models system hierarchy as a labeled tree, enhancing scalability and mapping quality.
Findings
Achieves good scalability on thousands of PEs.
Outperforms other MPI-based mapping tools in speed and quality.
Provides better mapping quality than some non-parallel tools.
Abstract
Processing massive application graphs on distributed memory systems requires to map the graphs onto the system's processing elements (PEs). This task becomes all the more important when PEs have non-uniform communication costs or the input is highly irregular. Typically, mapping is addressed using partitioning, in a two-step approach or an integrated one. Parallel partitioning tools do exist; yet, corresponding mapping algorithms or their public implementations all have major sequential parts or other severe scaling limitations. In this paper, we propose a parallel algorithm that maps graphs onto the PEs of a hierarchical system. Our solution integrates partitioning and mapping; it models the system hierarchy in a concise way as an implicit labeled tree. The vertices of the application graph are labeled as well, and these vertex labels induce the mapping. The mapping optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
