# Better Process Mapping and Sparse Quadratic Assignment

**Authors:** Christian Schulz, Jesper Larsson Tr\"aff, Konrad von Kirchbach

arXiv: 1702.04164 · 2019-07-23

## TL;DR

This paper introduces advanced algorithms for process mapping in parallel systems, leveraging sparse communication patterns and hierarchical system structures to improve solution quality and computational efficiency.

## Contribution

It presents novel multilevel algorithms for initial mapping and local search that exploit application and system hierarchies for better process-to-processor assignments.

## Key findings

- Algorithms significantly speed up local search.
- Multilevel approach finds better mappings in practice.
- Exploiting sparsity and hierarchy improves solution quality.

## Abstract

Communication and topology aware process mapping is a powerful approach to reduce communication time in parallel applications with known communication patterns on large, distributed memory systems. We address the problem as a quadratic assignment problem (QAP), and present algorithms to construct initial mappings of processes to processors, and fast local search algorithms to further improve the mappings. By exploiting assumptions that typically hold for applications and modern supercomputer systems such as sparse communication patterns and hierarchically organized communication systems, we obtain significantly more powerful algorithms for these special QAPs. Our multilevel construction algorithms employ perfectly balanced graph partitioning techniques and exploit the given communication system hierarchy in significant ways. We present improvements to a local search algorithm of Brandfass et al. (2013), and further decrease the running time by reducing the time needed to perform swaps in the assignment as well as by carefully constraining local search neighborhoods. We also investigate different algorithms to create the communication graph that is mapped onto the processor network. Experiments indicate that our algorithms not only dramatically speed up local search, but due to the multilevel approach also find much better solutions in practice.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1702.04164/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1702.04164/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1702.04164/full.md

---
Source: https://tomesphere.com/paper/1702.04164