Evaluating Large Language Models through Gender and Racial Stereotypes

Ananya Malik

arXiv:2311.14788·cs.CL·November 28, 2023·1 cites

Evaluating Large Language Models through Gender and Racial Stereotypes

Ananya Malik

PDF

Open Access

TL;DR

This paper develops a framework to evaluate large language models for gender and racial biases in professional contexts, revealing progress in gender bias reduction but persistent racial bias.

Contribution

It introduces a novel evaluation framework specifically targeting gender and racial biases in language models within professional scenarios.

Findings

01

Gender bias has decreased in newer models.

02

Racial bias remains prevalent.

03

Framework can identify bias levels effectively.

Abstract

Language Models have ushered a new age of AI gaining traction within the NLP community as well as amongst the general population. AI's ability to make predictions, generations and its applications in sensitive decision-making scenarios, makes it even more important to study these models for possible biases that may exist and that can be exaggerated. We conduct a quality comparative study and establish a framework to evaluate language models under the premise of two kinds of biases: gender and race, in a professional setting. We find out that while gender bias has reduced immensely in newer models, as compared to older ones, racial bias still exists.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Computational and Text Analysis Methods