A bag-of-paths framework for network data analysis
Kevin Fran\c{c}oisse, Ilkka Kivim\"aki, Amin Mantrach, Fabrice Rossi,, Marco Saerens

TL;DR
This paper introduces the bag-of-paths framework, a probabilistic approach to network analysis that computes node relatedness and distances by considering all paths with a Gibbs-Boltzmann distribution, enabling new metrics and applications.
Contribution
The paper presents a novel probabilistic framework for network analysis that generalizes shortest path and resistance distances using all paths with a Gibbs-Boltzmann distribution.
Findings
Derived new node distance measures from BoP probabilities.
Distances interpolate between shortest path and resistance distances.
Experimental results show competitive performance in semi-supervised classification.
Abstract
This work develops a generic framework, called the bag-of-paths (BoP), for link and network data analysis. The central idea is to assign a probability distribution on the set of all paths in a network. More precisely, a Gibbs-Boltzmann distribution is defined over a bag of paths in a network, that is, on a representation that considers all paths independently. We show that, under this distribution, the probability of drawing a path connecting two nodes can easily be computed in closed form by simple matrix inversion. This probability captures a notion of relatedness between nodes of the graph: two nodes are considered as highly related when they are connected by many, preferably low-cost, paths. As an application, two families of distances between nodes are derived from the BoP probabilities. Interestingly, the second distance family interpolates between the shortest path distance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
