TL;DR
This paper introduces LinkWaldo, a framework that efficiently identifies promising node pairs for link prediction by combining structural resemblance and proximity, addressing the challenge of vast search spaces in sparse graphs.
Contribution
It proposes a novel method that integrates stochastic block models with proximity measures to select candidate pairs, improving link prediction candidate quality.
Findings
LinkWaldo finds 7-33% more missing and future links than baseline methods.
The framework is adaptable to various node representations and heuristics.
Evaluation on 13 diverse networks demonstrates its effectiveness.
Abstract
The traditional setup of link prediction in networks assumes that a test set of node pairs, which is usually balanced, is available over which to predict the presence of links. However, in practice, there is no test set: the ground-truth is not known, so the number of possible pairs to predict over is quadratic in the number of nodes in the graph. Moreover, because graphs are sparse, most of these possible pairs will not be links. Thus, link prediction methods, which often rely on proximity-preserving embeddings or heuristic notions of node similarity, face a vast search space, with many pairs that are in close proximity, but that should not be linked. To mitigate this issue, we introduce LinkWaldo, a framework for choosing from this quadratic, massively-skewed search space of node pairs, a concise set of candidate pairs that, in addition to being in close proximity, also structurally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
