DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis

Prashanth Vijayaraghavan; Soroush Vosoughi; Lamogha Chiazor; Raya Horesh; Rogerio Abreu de Paula; Ehsan Degan; Vandana Mukherjee

arXiv:2505.14971·cs.CL·June 6, 2025

DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis

Prashanth Vijayaraghavan, Soroush Vosoughi, Lamogha Chiazor, Raya Horesh, Rogerio Abreu de Paula, Ehsan Degan, Vandana Mukherjee

PDF

Open Access

TL;DR

This paper introduces DECASTE, a framework for detecting caste biases in large language models across multiple societal dimensions, revealing systematic reinforcement of harmful stereotypes.

Contribution

It presents a novel multi-dimensional bias analysis framework specifically targeting caste biases in LLMs, filling a critical gap in bias evaluation methods.

Findings

01

LLMs exhibit significant caste biases across socio-cultural, economic, educational, and political dimensions.

02

Bias scores are higher for marginalized castes like Dalits and Shudras compared to dominant castes.

03

The study highlights the need for more inclusive bias mitigation strategies in LLM deployment.

Abstract

Recent advancements in large language models (LLMs) have revolutionized natural language processing (NLP) and expanded their applications across diverse domains. However, despite their impressive capabilities, LLMs have been shown to reflect and perpetuate harmful societal biases, including those based on ethnicity, gender, and religion. A critical and underexplored issue is the reinforcement of caste-based biases, particularly towards India's marginalized caste groups such as Dalits and Shudras. In this paper, we address this gap by proposing DECASTE, a novel, multi-dimensional framework designed to detect and assess both implicit and explicit caste biases in LLMs. Our approach evaluates caste fairness across four dimensions: socio-cultural, economic, educational, and political, using a range of customized prompting strategies. By benchmarking several state-of-the-art LLMs, we reveal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Natural Language Processing Techniques