High Risk of Political Bias in Black Box Emotion Inference Models

Hubert Plisiecki; Pawe{\l} Lenartowicz; Maria Flakus; Artur Pokropek

arXiv:2407.13891·cs.CL·November 22, 2024·1 cites

High Risk of Political Bias in Black Box Emotion Inference Models

Hubert Plisiecki, Pawe{\l} Lenartowicz, Maria Flakus, Artur Pokropek

PDF

Open Access

TL;DR

This study reveals significant political bias in emotion inference models used for sentiment analysis, demonstrating how biases in training data can skew social science research outcomes and proposing mitigation strategies.

Contribution

The paper provides a bias audit of a Polish sentiment analysis model, highlighting political bias propagation and testing dataset pruning as a mitigation method.

Findings

01

Bias in model predictions linked to political affiliations

02

Human annotations propagate political biases

03

Pruning training data reduces but does not eliminate bias

Abstract

This paper investigates the presence of political bias in emotion inference models used for sentiment analysis (SA) in social science research. Machine learning models often reflect biases in their training data, impacting the validity of their outcomes. While previous research has highlighted gender and race biases, our study focuses on political bias - an underexplored yet pervasive issue that can skew the interpretation of text data across a wide array of studies. We conducted a bias audit on a Polish sentiment analysis model developed in our lab. By analyzing valence predictions for names and sentences involving Polish politicians, we uncovered systematic differences influenced by political affiliations. Our findings indicate that annotations by human raters propagate political biases into the model's predictions. To mitigate this, we pruned the training dataset of texts mentioning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods