On the Bias of Traceroute Sampling; or, Power-law Degree Distributions in Regular Graphs
Dimitris Achlioptas, Aaron Clauset, David Kempe, and Cristopher Moore

TL;DR
This paper rigorously analyzes how traceroute sampling introduces bias in observed network degree distributions, often creating the illusion of power-law behavior even in networks with different underlying distributions.
Contribution
It provides a systematic, mathematical framework to quantify traceroute sampling bias across various degree distributions, extending previous empirical findings.
Findings
Traceroute sampling can produce power-law degree distributions from non-power-law networks.
The expected degree distribution of traceroute trees can be explicitly calculated for general degree distributions.
The analysis confirms and generalizes prior empirical observations of sampling bias in network topology studies.
Abstract
Understanding the structure of the Internet graph is a crucial step for building accurate network models and designing efficient algorithms for Internet applications. Yet, obtaining its graph structure is a surprisingly difficult task, as edges cannot be explicitly queried. Instead, empirical studies rely on traceroutes to build what are essentially single-source, all-destinations, shortest-path trees. These trees only sample a fraction of the network's edges, and a recent paper by Lakhina et al. found empirically that the resuting sample is intrinsically biased. For instance, the observed degree distribution under traceroute sampling exhibits a power law even when the underlying degree distribution is Poisson. In this paper, we study the bias of traceroute sampling systematically, and, for a very general class of underlying degree distributions, calculate the likely observed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Limits and Structures in Graph Theory · Graph theory and applications
