Synthetic graphs for link prediction benchmarking

Alexey Vlaskin; Eduardo G. Altmann

arXiv:2412.03757·cs.SI·February 14, 2025

Synthetic graphs for link prediction benchmarking

Alexey Vlaskin, Eduardo G. Altmann

PDF

1 Repo

TL;DR

This paper introduces a family of synthetic graphs with embedded motifs and communities to evaluate link prediction algorithms, providing theoretical bounds and revealing how structural features influence method performance.

Contribution

It presents a novel synthetic graph model with theoretical predictability bounds, facilitating better benchmarking of link prediction methods against structural complexities.

Findings

01

Performance correlates with theoretical predictability

02

No single method outperforms others universally

03

Different methods exploit different network features

Abstract

Predicting missing links in complex networks requires algorithms that are able to explore statistical regularities in the existing data. Here we investigate the interplay between algorithm efficiency and network structures through the introduction of suitably-designed synthetic graphs. We propose a family of random graphs that incorporates both micro-scale motifs and meso-scale communities, two ubiquitous structures in complex networks. A key contribution is the derivation of theoretical upper bounds for link prediction performance in our synthetic graphs, allowing us to estimate the predictability of the task and obtain an improved assessment of the performance of any method. Our results on the performance of classical methods (e.g., Stochastic Block Models, Node2Vec,GraphSage) show that the performance of all methods correlate with the theoretical predictability, that no single method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

avlaskin/synthetic-graphs-for-lp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.