Similarity-based Link Prediction from Modular Compression of Network Flows
Christopher Bl\"ocker, Jelena Smiljani\'c, Ingo Scholtes, Martin, Rosvall

TL;DR
This paper introduces MapSim, a novel information-theoretic measure for link prediction that uses modular compression of network flows, offering better interpretability and asymmetric similarity assessment in directed graphs.
Contribution
MapSim provides a new, unsupervised, compression-based approach for node similarity and link prediction that outperforms existing embedding methods in diverse networks.
Findings
MapSim achieves over 7% higher average performance than competitors.
It outperforms all embedding methods in 11 of 47 networks.
MapSim demonstrates the effectiveness of compression-based graph representations.
Abstract
Node similarity scores are a foundation for machine learning in graphs for clustering, node classification, anomaly detection, and link prediction with applications in biological systems, information networks, and recommender systems. Recent works on link prediction use vector space embeddings to calculate node similarities in undirected networks with good performance. Still, they have several disadvantages: limited interpretability, need for hyperparameter tuning, manual model fitting through dimensionality reduction, and poor performance from symmetric similarities in directed link prediction. We propose MapSim, an information-theoretic measure to assess node similarities based on modular compression of network flows. Unlike vector space embeddings, MapSim represents nodes in a discrete, non-metric space of communities and yields asymmetric similarities in an unsupervised fashion. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Bioinformatics and Genomic Networks
