Effects of algorithmic flagging on fairness: quasi-experimental evidence from Wikipedia
Nathan TeBlunthuis, Benjamin Mako Hill, Aaron Halfaker

TL;DR
This study investigates how algorithmic flagging influences fairness in Wikipedia moderation, showing that it increases reversion of flagged edits and reduces undoing moderation actions, but the effects depend on context.
Contribution
It provides causal evidence that algorithmic flagging can improve fairness in online moderation by reducing bias from social signals.
Findings
Flagged edits are reverted more often.
Flagging reduces the likelihood of moderation actions being undone.
Effects of flagging on fairness are context-dependent.
Abstract
Online community moderators often rely on social signals such as whether or not a user has an account or a profile page as clues that users may cause problems. Reliance on these clues can lead to "overprofiling'' bias when moderators focus on these signals but overlook the misbehavior of others. We propose that algorithmic flagging systems deployed to improve the efficiency of moderation work can also make moderation actions more fair to these users by reducing reliance on social signals and making norm violations by everyone else more visible. We analyze moderator behavior in Wikipedia as mediated by RCFilters, a system which displays social signals and algorithmic flags, and estimate the causal effect of being flagged on moderator actions. We show that algorithmically flagged edits are reverted more often, especially those by established editors with positive social signals, and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExperimental Behavioral Economics Studies · FinTech, Crowdfunding, Digital Finance · Media Influence and Politics
