Cultural Structures of Knowledge from Wikipedia Networks of First Links
Maxime Gabella

TL;DR
This paper uncovers how Wikipedia's network of first links reveals an implicit, culturally-influenced classification hierarchy centered on core concepts, reflecting historical and cultural differences across language editions.
Contribution
It demonstrates that an implicit classification structure naturally emerges from Wikipedia's link network, influenced by cultural history, without manual categorization.
Findings
A core cycle of fundamental concepts exists in Wikipedia networks.
Cultural differences influence the central concepts in different language editions.
The structure reflects historical and cultural backgrounds of language communities.
Abstract
Knowledge is useless without structure. While the classification of knowledge has been an enduring philosophical enterprise, it recently found applications in computer science, notably for artificial intelligence. The availability of large databases allowed for complex ontologies to be built automatically, for example by extracting structured content from Wikipedia. However, this approach is subject to manual categorization decisions made by online editors. Here we show that an implicit classification hierarchy emerges spontaneously on Wikipedia. We study the network of first links between articles, and find that it centers on a core cycle involving concepts of fundamental classifying importance. We argue that this structure is rooted in cultural history. For European languages, articles like Philosophy and Science are central, whereas Human and Earth dominate for East Asian languages.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration
