A Large-Scale Characterization of How Readers Browse Wikipedia
Tiziano Piccardi, Martin Gerlach, Akhil Arora, Robert West

TL;DR
This study provides a comprehensive large-scale analysis of how Wikipedia readers navigate, revealing diverse patterns influenced by topic, device, and time, and highlighting differences from lab-based navigation.
Contribution
It offers the first systematic analysis of Wikipedia browsing behavior using server logs, uncovering patterns and factors influencing navigation paths.
Findings
Navigation paths are mostly shallow but highly diverse.
Path depth varies systematically with topic, device, and time.
Navigation often links to external pages and is abandoned on low-quality content.
Abstract
Despite the importance and pervasiveness of Wikipedia as one of the largest platforms for open knowledge, surprisingly little is known about how people navigate its content when seeking information. To bridge this gap, we present the first systematic large-scale analysis of how readers browse Wikipedia. Using billions of page requests from Wikipedia's server logs, we measure how readers reach articles, how they transition between articles, and how these patterns combine into more complex navigation paths. We find that navigation behavior is characterized by highly diverse structures. Although most navigation paths are shallow, comprising a single pageload, there is much variety, and the depth and shape of paths vary systematically with topic, device type, and time of day. We show that Wikipedia navigation paths commonly mesh with external pages as part of a larger online ecosystem, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration
