The Sum-over-Forests density index: identifying dense regions in a graph
Mathieu Senelle, Silvia Garcia-Diez, Amin Mantrach, Masashi Shimbo,, Marco Saerens, Fran\c{c}ois Fouss

TL;DR
The paper introduces the Sum-over-Forests (SoF) density index, a nonparametric method for identifying dense regions in graphs based on probabilistic forest models and matrix computations.
Contribution
It presents a novel density index for graphs that leverages a probabilistic forest framework and can be computed efficiently via matrix inversion.
Findings
Effective in detecting dense regions in artificial and real graphs.
Performs well across various types of graph data.
Provides a closed-form computation method.
Abstract
This work introduces a novel nonparametric density index defined on graphs, the Sum-over-Forests (SoF) density index. It is based on a clear and intuitive idea: high-density regions in a graph are characterized by the fact that they contain a large amount of low-cost trees with high outdegrees while low-density regions contain few ones. Therefore, a Boltzmann probability distribution on the countable set of forests in the graph is defined so that large (high-cost) forests occur with a low probability while short (low-cost) forests occur with a high probability. Then, the SoF density index of a node is defined as the expected outdegree of this node in a non-trivial tree of the forest, thus providing a measure of density around that node. Following the matrix-forest theorem, and a statistical physics framework, it is shown that the SoF density index can be easily computed in closed form…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Data Visualization and Analytics
