A large-scale study of the World Wide Web: network correlation functions with scale-invariant boundaries
G.A. Luduena, H. Meixner, Gregor Kaczor, Claudius Gros

TL;DR
This large-scale study analyzes the structural correlations within the World Wide Web, revealing power-law behaviors and hierarchical organization through extensive network metrics and traffic data.
Contribution
It introduces a comprehensive analysis of web network correlations, including directed clustering coefficients and traffic rank relationships, highlighting scale-invariant boundaries and hierarchical structures.
Findings
Strong correlation effects in network properties.
Power-law dependencies in correlation boundaries.
Hierarchical structure indicated by link count thresholds.
Abstract
We performed a large-scale crawl of the World Wide Web, covering 6.9 Million domains and 57 Million subdomains, including all high-traffic sites of the Internet. We present a study of the correlations found between quantities measuring the structural relevance of each node in the network (the in- and out-degree, the local clustering coefficient, the first-neighbor in-degree and the Alexa rank). We find that some of these properties show strong correlation effects and that the dependencies occurring out of these correlations follow power laws not only for the averages, but also for the boundaries of the respective density distributions. In addition, these scale-free limits do not follow the same exponents as the corresponding averages. In our study we retain the directionality of the hyperlinks and develop a statistical estimate for the clustering coefficient of directed graphs. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
