# Optimizing Communication by Compression for Multi-GPU Scalable   Breadth-First Searches

**Authors:** Julian Romera

arXiv: 1704.00513 · 2017-04-04

## TL;DR

This paper introduces a compression scheme to optimize communication in distributed BFS algorithms on multi-GPU systems, aiming to improve performance by reducing inter-processor communication overhead.

## Contribution

It proposes a novel compression method tailored for GPU-based distributed BFS, addressing communication bottlenecks in parallel graph processing.

## Key findings

- Reduced communication volume in multi-GPU BFS
- Improved scalability of BFS algorithms on distributed systems
- Potential for faster graph analytics in GPU clusters

## Abstract

The Breadth First Search (BFS) algorithm is the foundation and building block of many higher graph-based operations such as spanning trees, shortest paths and betweenness centrality. The importance of this algorithm increases each day due to it is a key requirement for many data structures which are becoming popular nowadays. When the BFS algorithm is parallelized by distributing the graph between several processors the interconnection network limits the performance. Hence, improvements on this area may benefit the overall performance of the algorithm.   This work presents an alternative compression scheme for communications in distributed BFS processing. It focuses on BFS processors using General-Purpose Graphics Processing Units.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.00513/full.md

## Figures

39 figures with captions in the complete paper: https://tomesphere.com/paper/1704.00513/full.md

## References

68 references — full list in the complete paper: https://tomesphere.com/paper/1704.00513/full.md

---
Source: https://tomesphere.com/paper/1704.00513