New Approximation Algorithms for Forest Closeness Centrality -- for Individual Vertices and Vertex Groups
Alexander van der Grinten, Eugenio Angriman, Maria Predari, Henning, Meyerhenke

TL;DR
This paper introduces a faster, more accurate approximation algorithm for forest closeness centrality in large graphs and explores the NP-hard problem of finding optimal vertex groups, demonstrating its effectiveness in classification tasks.
Contribution
It presents a nearly-linear time approximation algorithm for forest closeness centrality with probabilistic error guarantees and introduces the first approach to find optimal vertex groups for this measure.
Findings
The new algorithm is up to two orders of magnitude faster and more accurate.
Group forest closeness outperforms existing measures in semi-supervised classification.
The group vertex selection problem is NP-hard, with a proposed approximation method.
Abstract
The emergence of massive graph data sets requires fast mining algorithms. Centrality measures to identify important vertices belong to the most popular analysis methods in graph mining. A measure that is gaining attention is forest closeness centrality; it is closely related to electrical measures using current flow but can also handle disconnected graphs. Recently, [Jin et al., ICDM'19] proposed an algorithm to approximate this measure probabilistically. Their algorithm processes small inputs quickly, but does not scale well beyond hundreds of thousands of vertices. In this paper, we first propose a different approximation algorithm; it is up to two orders of magnitude faster and more accurate in practice. Our method exploits the strong connection between uniform spanning trees and forest distances by adapting and extending recent approximation algorithms for related single-vertex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
