Effects of algorithmic flagging on fairness: quasi-experimental evidence from Wikipedia

Nathan TeBlunthuis; Benjamin Mako Hill; Aaron Halfaker

arXiv:2006.03121·cs.CY·January 6, 2026

Effects of algorithmic flagging on fairness: quasi-experimental evidence from Wikipedia

Nathan TeBlunthuis, Benjamin Mako Hill, Aaron Halfaker

PDF

Open Access

TL;DR

This study investigates how algorithmic flagging influences fairness in Wikipedia moderation, showing that it increases reversion of flagged edits and reduces undoing moderation actions, but the effects depend on context.

Contribution

It provides causal evidence that algorithmic flagging can improve fairness in online moderation by reducing bias from social signals.

Findings

01

Flagged edits are reverted more often.

02

Flagging reduces the likelihood of moderation actions being undone.

03

Effects of flagging on fairness are context-dependent.

Abstract

Online community moderators often rely on social signals such as whether or not a user has an account or a profile page as clues that users may cause problems. Reliance on these clues can lead to "overprofiling'' bias when moderators focus on these signals but overlook the misbehavior of others. We propose that algorithmic flagging systems deployed to improve the efficiency of moderation work can also make moderation actions more fair to these users by reducing reliance on social signals and making norm violations by everyone else more visible. We analyze moderator behavior in Wikipedia as mediated by RCFilters, a system which displays social signals and algorithmic flags, and estimate the causal effect of being flagged on moderator actions. We show that algorithmically flagged edits are reverted more often, especially those by established editors with positive social signals, and that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Behavioral Economics Studies · FinTech, Crowdfunding, Digital Finance · Media Influence and Politics