TL;DR
This paper critically examines popular web rankings like Alexa, demonstrating their vulnerability to manipulation and proposing Tranco, a more stable and manipulation-resistant ranking system for research use.
Contribution
The paper introduces Tranco, a new web ranking that is more stable and resistant to manipulation, improving the reliability of web research.
Findings
Existing rankings are easily manipulated, even with minimal effort.
Manipulation can significantly alter the composition of top sites.
Tranco offers a more stable and secure alternative for research.
Abstract
In order to evaluate the prevalence of security and privacy practices on a representative sample of the Web, researchers rely on website popularity rankings such as the Alexa list. While the validity and representativeness of these rankings are rarely questioned, our findings show the contrary: we show for four main rankings how their inherent properties (similarity, stability, representativeness, responsiveness and benignness) affect their composition and therefore potentially skew the conclusions made in studies. Moreover, we find that it is trivial for an adversary to manipulate the composition of these lists. We are the first to empirically validate that the ranks of domains in each of the lists are easily altered, in the case of Alexa through as little as a single HTTP request. This allows adversaries to manipulate rankings on a large scale and insert malicious domains into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
