Stereotype Detection as a Catalyst for Enhanced Bias Detection: A Multi-Task Learning Approach

Aditya Tomar; Rudra Murthy; Pushpak Bhattacharyya

arXiv:2507.01715·cs.CL·July 3, 2025

Stereotype Detection as a Catalyst for Enhanced Bias Detection: A Multi-Task Learning Approach

Aditya Tomar, Rudra Murthy, Pushpak Bhattacharyya

PDF

Open Access

TL;DR

This paper proposes a multi-task learning approach for bias and stereotype detection in language models, demonstrating that joint training improves bias detection performance and leveraging stereotype information enhances fairness.

Contribution

It introduces StereoBias, a new dataset for bias and stereotype detection, and shows that joint learning of these tasks improves bias detection accuracy across different model architectures.

Findings

01

Joint training significantly improves bias detection performance.

02

Encoder-only models perform well, but decoder-only models are competitive.

03

Leveraging stereotype information enhances fairness in AI systems.

Abstract

Bias and stereotypes in language models can cause harm, especially in sensitive areas like content moderation and decision-making. This paper addresses bias and stereotype detection by exploring how jointly learning these tasks enhances model performance. We introduce StereoBias, a unique dataset labeled for bias and stereotype detection across five categories: religion, gender, socio-economic status, race, profession, and others, enabling a deeper study of their relationship. Our experiments compare encoder-only models and fine-tuned decoder-only models using QLoRA. While encoder-only models perform well, decoder-only models also show competitive results. Crucially, joint training on bias and stereotype detection significantly improves bias detection compared to training them separately. Additional experiments with sentiment analysis confirm that the improvements stem from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Spam and Phishing Detection