High Risk of Political Bias in Black Box Emotion Inference Models
Hubert Plisiecki, Pawe{\l} Lenartowicz, Maria Flakus, Artur Pokropek

TL;DR
This study reveals significant political bias in emotion inference models used for sentiment analysis, demonstrating how biases in training data can skew social science research outcomes and proposing mitigation strategies.
Contribution
The paper provides a bias audit of a Polish sentiment analysis model, highlighting political bias propagation and testing dataset pruning as a mitigation method.
Findings
Bias in model predictions linked to political affiliations
Human annotations propagate political biases
Pruning training data reduces but does not eliminate bias
Abstract
This paper investigates the presence of political bias in emotion inference models used for sentiment analysis (SA) in social science research. Machine learning models often reflect biases in their training data, impacting the validity of their outcomes. While previous research has highlighted gender and race biases, our study focuses on political bias - an underexplored yet pervasive issue that can skew the interpretation of text data across a wide array of studies. We conducted a bias audit on a Polish sentiment analysis model developed in our lab. By analyzing valence predictions for names and sentences involving Polish politicians, we uncovered systematic differences influenced by political affiliations. Our findings indicate that annotations by human raters propagate political biases into the model's predictions. To mitigate this, we pruned the training dataset of texts mentioning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
