TL;DR
This study analyzes the growth and inequality patterns in Wikimedia projects, revealing early formation of supereditor oligopolies and suggesting that such disparities are inherent and unlikely to resolve spontaneously.
Contribution
It introduces a comprehensive analysis of inequality dynamics across all Wikimedia projects and proposes a model explaining the early emergence of supereditor oligopolies.
Findings
Inequality, measured by Gini coefficient, rapidly increases in Wikimedia data.
Supereditor groups form early and remain dominant over time.
Inequality is an inherent feature of open-editing communal datasets.
Abstract
We perform an in-depth analysis on the inequality in 863 Wikimedia projects. We take the complete editing history of 267,304,095 Wikimedia items until 2016, which not only covers every language edition of Wikipedia, but also embraces the complete versions of Wiktionary, Wikisource, Wikivoyage, etc. Our findings of common growth pattern described by the interrelations between four characteristic growth yardsticks suggest a universal law of communal data formation. In this encyclopaedic data set, we observe the interplay between the number of edits and the degree of inequality. In particular, the rapid increase in the Gini coefficient suggests that this entrenched inequality stems from the nature of such open-editing communal data sets, namely the abiogenesis of the supereditors' oligopoly. We show that these supereditor groups were created at the early stages of these open-editing media…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
