Are Large Language Models a Threat to Digital Public Goods? Evidence from Activity on Stack Overflow
Maria del Rio-Chanona, Nadzeya Laurentsyeva, Johannes Wachs

TL;DR
This study examines how the adoption of ChatGPT has led to a significant decline in activity on Stack Overflow, especially for popular programming languages, potentially impacting the availability of open data for future AI models.
Contribution
It provides empirical evidence of ChatGPT's impact on online programming communities and quantifies the decline in user-generated content on Stack Overflow due to AI substitution.
Findings
16% decrease in weekly Stack Overflow posts post-ChatGPT
Greater decline for popular programming languages
No significant change in voting scores for posts
Abstract
Large language models like ChatGPT efficiently provide users with information about various topics, presenting a potential substitute for searching the web and asking people for help online. But since users interact privately with the model, these models may drastically reduce the amount of publicly available human-generated data and knowledge resources. This substitution can present a significant problem in securing training data for future models. In this work, we investigate how the release of ChatGPT changed human-generated open data on the web by analyzing the activity on Stack Overflow, the leading online Q\&A platform for computer programming. We find that relative to its Russian and Chinese counterparts, where access to ChatGPT is limited, and to similar forums for mathematics, where ChatGPT is less capable, activity on Stack Overflow significantly decreased. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems · Online Learning and Analytics · FinTech, Crowdfunding, Digital Finance
