# Walking Randomly, Massively, and Efficiently

**Authors:** Jakub {\L}\k{a}cki, Slobodan Mitrovi\'c, Krzysztof Onak, Piotr, Sankowski

arXiv: 1907.05391 · 2019-11-07

## TL;DR

This paper presents new techniques for efficiently generating multiple independent random walks in the MPC model, breaking previous round complexity barriers for key graph problems like PageRank, bipartiteness, and expansion testing.

## Contribution

It introduces a novel approach to generate many independent random walks in sublinear space per machine, achieving near-optimal round complexity for PageRank and related problems in the MPC model.

## Key findings

- Random walks of length l can be generated in Θ(log l) rounds.
- The approach breaks the Θ(log n) round barrier for PageRank in MPC.
- Round complexity is asymptotically tight under the 1-vs.-2-Cycles conjecture.

## Abstract

We introduce a set of techniques that allow for efficiently generating many independent random walks in the Massive Parallel Computation (MPC) model with space per machine strongly sublinear in the number of vertices. In this space-per-machine regime, many natural approaches to graph problems struggle to overcome the $\Theta(\log n)$ MPC round complexity barrier. Our techniques enable breaking this barrier for PageRank---one of the most important applications of random walks---even in more challenging directed graphs, and for approximate bipartiteness and expansion testing.   In the undirected case, we start our random walks from the stationary distribution, which implies that we approximately know the empirical distribution of their next steps. This allows for preparing continuations of random walks in advance and applying a doubling approach. As a result we can generate multiple random walks of length $l$ in $\Theta(\log l)$ rounds on MPC. Moreover, we show that under the popular 1-vs.-2-Cycles conjecture, this round complexity is asymptotically tight.   For directed graphs, our approach stems from our treatment of the PageRank Markov chain. We first compute the PageRank for the undirected version of the input graph and then slowly transition towards the directed case, considering convex combinations of the transition matrices in the process.   For PageRank, we achieve the following round complexities for damping factor equal to $1 - \epsilon$:   * in $O(\log \log n + \log 1 / \epsilon)$ rounds for undirected graphs (with $\tilde O(m / \epsilon^2)$ total space),   * in $\tilde O(\log^2 \log n + \log^2 1/\epsilon)$ rounds for directed graphs (with $\tilde O((m+n^{1+o(1)}) / poly\, \epsilon)$ total space).

---
Source: https://tomesphere.com/paper/1907.05391