A Test of Time: Predicting the Sustainable Success of Online Collaboration in Wikipedia
Abraham Israeli, David Jurgens, Daniel Romero

TL;DR
This paper introduces a new metric for assessing the long-term sustainability of quality in online collaborative projects, using Wikipedia data and machine learning to predict which articles maintain high standards over time.
Contribution
It proposes the 'Sustainable Success' metric, creates the SustainPedia dataset, and develops predictive models that identify key factors influencing long-term quality preservation in Wikipedia articles.
Findings
Longer recognition time correlates with sustained quality.
User experience is the most important predictor of sustainability.
Predictive models achieve an AU-ROC of 0.88 on average.
Abstract
The Internet has significantly expanded the potential for global collaboration, allowing millions of users to contribute to collective projects like Wikipedia. While prior work has assessed the success of online collaborations, most approaches are time-agnostic, evaluating success without considering its longevity. Research on the factors that ensure the long-term preservation of high-quality standards in online collaboration is scarce. In this study, we address this gap. We propose a novel metric, `Sustainable Success,' which measures the ability of collaborative efforts to maintain their quality over time. Using Wikipedia as a case study, we introduce the SustainPedia dataset, which compiles data from over 40K Wikipedia articles, including each article's sustainable success label and more than 300 explanatory features such as edit history, user experience, and team composition. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration · Web and Library Services · Open Source Software Innovations
