Content Moderation and the Formation of Online Communities: A Theoretical Framework
Cynthia Dwork, Chris Hays, Jon Kleinberg, Manish Raghavan

TL;DR
This paper develops a theoretical framework to analyze how content moderation policies influence online community participation, content diversity, and platform competition, revealing nuanced and sometimes counter-intuitive effects.
Contribution
It introduces a mathematically tractable model that captures the complex effects of moderation policies on community stability, participation, and content diversity, considering resource limitations and platform competition.
Findings
Moderation policies can increase participation and diversify content.
Content moderation effectiveness depends on platform resources and ideological goals.
Platforms' moderation strategies influence community stability and competition.
Abstract
We study the impact of content moderation policies in online communities. In our theoretical model, a platform chooses a content moderation policy and individuals choose whether or not to participate in the community according to the fraction of user content that aligns with their preferences. The effects of content moderation, at first blush, might seem obvious: it restricts speech on a platform. However, when user participation decisions are taken into account, its effects can be more subtle and counter-intuitive. For example, our model can straightforwardly demonstrate how moderation policies may increase participation and diversify content available on the platform. In our analysis, we explore a rich set of interconnected phenomena related to content moderation in online communities. We first characterize the effectiveness of a natural class of moderation policies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Social Media and Politics · Opinion Dynamics and Social Influence
