A statistical approach to the traceroute-like exploration of networks: theory and simulations
Luca Dall'Asta, Ignacio Alvarez-Hamelin, Alain Barrat, Alexei Vazquez,, Alessandro Vespignani

TL;DR
This paper analyzes the biases in network mapping via traceroute-like probes, providing a statistical framework that relates sampling accuracy to network topology and exploring strategies to optimize network discovery.
Contribution
It introduces a mean-field analytical model for sampling biases, linking detection probabilities to betweenness centrality, and evaluates sampling strategies through simulations.
Findings
Detection probability depends on betweenness centrality.
Shortest path sampling better characterizes scale-free networks.
Optimal exploration parameters improve node and edge discovery.
Abstract
Mapping the Internet generally consists in sampling the network from a limited set of sources by using "traceroute"-like probes. This methodology, akin to the merging of different spanning trees to a set of destinations, has been argued to introduce uncontrolled sampling biases that might produce statistical properties of the sampled graph which sharply differ from the original ones. Here we explore these biases and provide a statistical analysis of their origin. We derive a mean-field analytical approximation for the probability of edge and vertex detection that exploits the role of the number of sources and targets and allows us to relate the global topological properties of the underlying network with the statistical accuracy of the sampled graph. In particular we find that the edge and vertex detection probability is depending on the betweenness centrality of each element. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Opinion Dynamics and Social Influence
