Unfair Mistakes on Social Media: How Demographic Characteristics influence Authorship Attribution

Jasmin Wyss; Rebekah Overdorf

arXiv:2510.19708·cs.SI·October 23, 2025

Unfair Mistakes on Social Media: How Demographic Characteristics influence Authorship Attribution

Jasmin Wyss, Rebekah Overdorf

PDF

Open Access

TL;DR

This study systematically audits authorship attribution models on social media for demographic bias, revealing that while models seem fair in closed settings, errors tend to favor users sharing demographic traits, highlighting fairness issues in real-world scenarios.

Contribution

The paper provides a comprehensive bias audit of authorship attribution models across multiple demographic groups, revealing nuanced fairness issues especially when true authors are excluded from candidate sets.

Findings

01

Authorship attribution models show no bias in closed-world settings.

02

Errors tend to favor users sharing demographic traits with the true author.

03

Fairness in closed settings does not guarantee fairness in open-world error scenarios.

Abstract

Authorship attribution techniques are increasingly being used in online contexts such as sock puppet detection, malicious account linking, and cross-platform account linking. Yet, it is unknown whether these models perform equitably across different demographic groups. Bias in such techniques could lead to false accusations, account banning, and privacy violations disproportionately impacting users from certain demographics. In this paper, we systematically audit authorship attribution for bias with respect to gender, native language, and age. We evaluate fairness in 3 ways. First, we evaluate how the proportion of users with a certain demographic characteristic impacts the overall classifier performance. Second, we evaluate if a user's demographic characteristics influence the probability that their texts are misclassified. Our analysis indicates that authorship attribution does not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection