Scaling Betweenness Approximation to Billions of Edges by MPI-based   Adaptive Sampling

Alexander van der Grinten; Henning Meyerhenke

arXiv:1910.11039·cs.DC·October 25, 2019

Scaling Betweenness Approximation to Billions of Edges by MPI-based Adaptive Sampling

Alexander van der Grinten, Henning Meyerhenke

PDF

TL;DR

This paper introduces an MPI-based adaptive sampling algorithm that significantly accelerates betweenness centrality approximation, enabling analysis of billion-edge graphs in under ten minutes with high accuracy.

Contribution

It presents the first MPI-parallelized adaptive sampling algorithm for betweenness approximation, achieving substantial speedups over shared-memory methods on large-scale graphs.

Findings

01

16.1x faster than shared-memory implementation for the sampling phase

02

7.4x average speedup for the complete algorithm

03

Able to approximate betweenness on graphs with billions of edges in under ten minutes

Abstract

Betweenness centrality is one of the most popular vertex centrality measures in network analysis. Hence, many (sequential and parallel) algorithms to compute or approximate betweenness have been devised. Recent algorithmic advances have made it possible to approximate betweenness very efficiently on shared-memory architectures. Yet, the best shared-memory algorithms can still take hours of running time for large graphs, especially for graphs with a high diameter or when a small relative error is required. In this work, we present an MPI-based generalization of the state-of-the-art shared-memory algorithm for betweenness approximation. This algorithm is based on adaptive sampling; our parallelization strategy can be applied in the same manner to adaptive sampling algorithms for other problems. In experiments on a 16-node cluster, our MPI-based implementation is by a factor of 16.1x…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.