Demographic Biases and Gaps in the Perception of Sexism in Large Language Models
Judith Tavarez-Rodr\'iguez, Fernando S\'anchez-Vega, A. Pastor L\'opez-Monroy

TL;DR
This paper investigates how large language models detect sexism in social media, revealing they lack the ability to accurately reflect diverse demographic perceptions, highlighting biases and the need for more inclusive models.
Contribution
It evaluates LLMs' ability to detect sexism across demographic groups and analyzes biases, emphasizing the importance of accounting for diverse perspectives in automated detection.
Findings
LLMs can detect sexism to some extent but lack demographic diversity.
Models do not accurately reflect perceptions of minority groups.
Demographic biases significantly influence sexism detection performance.
Abstract
The use of Large Language Models (LLMs) has proven to be a tool that could help in the automatic detection of sexism. Previous studies have shown that these models contain biases that do not accurately reflect reality, especially for minority groups. Despite various efforts to improve the detection of sexist content, this task remains a significant challenge due to its subjective nature and the biases present in automated models. We explore the capabilities of different LLMs to detect sexism in social media text using the EXIST 2024 tweet dataset. It includes annotations from six distinct profiles for each tweet, allowing us to evaluate to what extent LLMs can mimic these groups' perceptions in sexism detection. Additionally, we analyze the demographic biases present in the models and conduct a statistical analysis to identify which demographic characteristics (age, gender) contribute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
