Understanding Editing Behaviors in Multilingual Wikipedia
Suin Kim, Sungjoon Park, Scott A. Hale, Sooyoung Kim, Jeongmin Byun, and Alice Oh

TL;DR
This study analyzes multilingual Wikipedia editors to understand their editing behaviors, language proficiency, and content complexity, revealing English's unique role and the challenges of language barriers in content spread.
Contribution
It provides a large-scale computational analysis of multilingual editing patterns across Wikipedia, highlighting differences in content complexity and engagement based on language proficiency.
Findings
English editors engage with complex content similarly regardless of primary language.
Content complexity is lower in second-language edits, indicating a complexity barrier.
Multilingual editors show lower engagement and proficiency in their second languages.
Abstract
Multilingualism is common offline, but we have a more limited understanding of the ways multilingualism is displayed online and the roles that multilinguals play in the spread of content between speakers of different languages. We take a computational approach to studying multilingualism using one of the largest user-generated content platforms, Wikipedia. We study multilingualism by collecting and analyzing a large dataset of the content written by multilingual editors of the English, German, and Spanish editions of Wikipedia. This dataset contains over two million paragraphs edited by over 15,000 multilingual users from July 8 to August 9, 2013. We analyze these multilingual editors in terms of their engagement, interests, and language proficiency in their primary and non-primary (secondary) languages and find that the English edition of Wikipedia displays different dynamics from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
