Connecting every bit of knowledge: The structure of Wikipedia's First Link Network
Mark Ibrahim, Christopher M. Danforth, Peter Sheridan Dodds

TL;DR
This paper analyzes Wikipedia's First Link Network, revealing a scale-free structure where most paths lead to Philosophy, highlighting the network's organization and influence of key articles.
Contribution
Introduces a novel traversal funnels method to measure influence in directed networks and applies it to Wikipedia's link structure.
Findings
Most paths lead to Philosophy, indicating centrality.
First link accumulation follows a scale-free distribution.
Key articles like Philosophy, Health Care, and Fossil Fuel dominate influence.
Abstract
Apples, porcupines, and the most obscure Bob Dylan song---is every topic a few clicks from Philosophy? Within Wikipedia, the surprising answer is yes: nearly all paths lead to Philosophy. Wikipedia is the largest, most meticulously indexed collection of human knowledge ever amassed. More than information about a topic, Wikipedia is a web of naturally emerging relationships. By following the first link in each article, we algorithmically construct a directed network of all 4.7 million articles: Wikipedia's First Link Network. Here, we study the English edition of Wikipedia's First Link Network for insight into how the many articles on inventions, places, people, objects, and events are related and organized. By traversing every path, we measure the accumulation of first links, path lengths, groups of path-connected articles, and cycles. We also develop a new method, traversal funnels,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
