PowerWalk: Scalable Personalized PageRank via Random Walks with Vertex-Centric Decomposition
Qin Liu, Zhenguo Li, John C.S. Lui, Jiefeng Cheng

TL;DR
PowerWalk introduces a scalable distributed framework for Personalized PageRank that balances offline fingerprinting with online query efficiency, outperforming existing methods on large graphs.
Contribution
The paper presents a novel distributed approach using random walks and vertex-centric decomposition to efficiently compute personalized PageRank on large graphs.
Findings
Achieves exponential convergence with fewer random walks.
Provides highly accurate PPR vectors using more random walks.
Outperforms state-of-the-art algorithms by orders of magnitude.
Abstract
Most methods for Personalized PageRank (PPR) precompute and store all accurate PPR vectors, and at query time, return the ones of interest directly. However, the storage and computation of all accurate PPR vectors can be prohibitive for large graphs, especially in caching them in memory for real-time online querying. In this paper, we propose a distributed framework that strikes a better balance between offline indexing and online querying. The offline indexing attains a fingerprint of the PPR vector of each vertex by performing billions of "short" random walks in parallel across a cluster of machines. We prove that our indexing method has an exponential convergence, achieving the same precision with previous methods using a much smaller number of random walks. At query time, the new PPR vector is composed by a linear combination of related fingerprints, in a highly efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
