Near-Optimal Two-Pass Streaming Algorithm for Sampling Random Walks over Directed Graphs
Lijie Chen, Gillat Kol, Dmitry Paramonov, Raghuvansh Saxena, Zhao, Song, Huacheng Yu

TL;DR
This paper establishes the optimal space complexity for two-pass streaming algorithms to sample random walks in directed graphs, significantly improving previous bounds and extending results to multiple passes.
Contribution
It provides tight bounds on the space complexity of two-pass algorithms for random walk sampling, and extends the lower bounds to any constant number of passes.
Findings
Two-pass algorithms require (n L) space.
Lower bounds apply to any constant number of passes, requiring (n L^{1/p}) space.
Results have implications for efficient PageRank approximation and related graph algorithms.
Abstract
For a directed graph with vertices and a start vertex , we wish to (approximately) sample an -step random walk over starting from with minimum space using an algorithm that only makes few passes over the edges of the graph. This problem found many applications, for instance, in approximating the PageRank of a webpage. If only a single pass is allowed, the space complexity of this problem was shown to be . Prior to our work, a better space complexity was only known with passes. We settle the space complexity of this random walk simulation problem for two-pass streaming algorithms, showing that it is , by giving almost matching upper and lower bounds. Our lower bound argument extends to every constant number of passes , and shows that any -pass algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
