Hierarchical Hyperlink Prediction for the WWW
Dario Garcia-Gasulla, Eduard Ayguad\'e, Jes\'us Labarta, Ulises, Cort\'es, Toyotaro Suzumura

TL;DR
This paper introduces a hierarchical similarity-based method for hyperlink prediction on large webgraphs, improving accuracy while maintaining scalability, and discusses high-performance computing implementation details.
Contribution
The paper proposes a novel hierarchical approach for hyperlink prediction that outperforms existing similarity-based algorithms on large webgraphs.
Findings
Hierarchical method significantly improves prediction accuracy.
Approach is scalable and suitable for large webgraphs.
Implementation details enable high-performance computation.
Abstract
The hyperlink prediction task, that of proposing new links between webpages, can be used to improve search engines, expand the visibility of web pages, and increase the connectivity and navigability of the web. Hyperlink prediction is typically performed on webgraphs composed by thousands or millions of vertices, where on average each webpage contains less than fifty links. Algorithms processing graphs so large and sparse require to be both scalable and precise, a challenging combination. Similarity-based algorithms are among the most scalable solutions within the link prediction field, due to their parallel nature and computational simplicity. These algorithms independently explore the nearby topological features of every missing link from the graph in order to determine its likelihood. Unfortunately, the precision of similarity-based algorithms is limited, which has prevented their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Web Data Mining and Analysis · Advanced Graph Neural Networks
