Graphlets correct for the topological information missed by random walks
Sam F. L. Windels, Noel Malod-Dognin, Natasa Przulj

TL;DR
This paper introduces orbit adjacencies to explicitly capture topological neighborhood information in networks, improving network embedding performance over traditional random walk methods by including topological details missed by random walks.
Contribution
The authors develop a novel orbit adjacency framework and an efficient counter (GRADCO) to quantify all orbit adjacencies up to four-node graphlets, enhancing network analysis.
Findings
Orbit adjacencies outperform random walk-based methods.
GRADCO efficiently computes all orbit adjacency matrices.
Including topological neighborhood info improves node-label prediction.
Abstract
Random walks are widely used for mining networks due to the computational efficiency of computing them. For instance, graph representation learning learns a d-dimensional embedding space, so that the nodes that tend to co-occur on random walks (a proxy of being in the same network neighborhood) are close in the embedding space. Specific local network topology (i.e., structure) influences the co-occurrence of nodes on random walks, so random walks of limited length capture only partial topological information, hence diminishing the performance of downstream methods. We explicitly capture all topological neighborhood information and improve performance by introducing orbit adjacencies that quantify the adjacencies of two nodes as co-occurring on a given pair of graphlet orbits, which are symmetric positions on graphlets (small, connected, non-isomorphic, induced subgraphs of a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis
