A Topological Perspective on Demystifying GNN-Based Link Prediction Performance
Yu Wang, Tong Zhao, Yuying Zhao, Yunchao Liu, Xueqi Cheng, Neil Shah,, Tyler Derr

TL;DR
This paper introduces Topological Concentration (TC), a new metric based on local topology, to better understand and improve GNN-based link prediction performance, especially for low-degree nodes, and proposes scalable approximations and enhancement strategies.
Contribution
It proposes Topological Concentration (TC) as a novel metric for node performance prediction in GNNs, introduces an efficient approximation (ATC), and explores methods to boost link prediction by enhancing TC.
Findings
TC correlates better with LP performance than degree or density.
A topological distribution shift affects newly joined neighbors.
Enhancing TC can improve LP performance, with some limitations.
Abstract
Graph Neural Networks (GNNs) have shown great promise in learning node embeddings for link prediction (LP). While numerous studies aim to improve the overall LP performance of GNNs, none have explored its varying performance across different nodes and its underlying reasons. To this end, we aim to demystify which nodes will perform better from the perspective of their local topology. Despite the widespread belief that low-degree nodes exhibit poorer LP performance, our empirical findings provide nuances to this viewpoint and prompt us to propose a better metric, Topological Concentration (TC), based on the intersection of the local subgraph of each node with the ones of its neighbors. We empirically demonstrate that TC has a higher correlation with LP performance than other node-level topological metrics like degree and subgraph density, offering a better way to identify low-performing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Topological and Geometric Data Analysis
MethodsNone
