Scalable and Provable Kemeny Constant Computation on Static and Dynamic Graphs: A 2-Forest Sampling Approach
Cheng Li, Meihao Liao, Rong-Hua Li, Guoren Wang

TL;DR
This paper introduces a novel 2-forest sampling method for efficiently and accurately approximating the Kemeny constant on large static and dynamic graphs, with strong theoretical guarantees and practical performance.
Contribution
It presents a new unbiased estimator based on 2-forest sampling, along with algorithms for efficient sampling and dynamic updates, improving over existing random walk-based methods.
Findings
Outperforms state-of-the-art methods in efficiency and accuracy
Provides near-linear time complexity for approximation
Effective on large real-world static and dynamic graphs
Abstract
Kemeny constant, defined as the expected hitting time of random walks from a source node to a randomly chosen target node, is a fundamental metric in graph data management with many real-world applications. However, computing it exactly on large graphs is highly challenging, as it requires inverting large graph matrices. Existing solutions mainly rely on approximate random-walk-based methods, which still need large sample sizes and lack strong theoretical guarantees. In this paper, we propose a new approach for approximating the Kemeny constant via 2-forest sampling. We first derive an unbiased estimator expressed through spanning trees by introducing a path mapping technique that establishes a direct correspondence between spanning trees and certain classes of 2-forests. Compared to random walk-based estimators, 2-forest-based estimators yield leads to a better theoretical bound. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Complex Network Analysis Techniques
