Responsible AI in NLP: GUS-Net Span-Level Bias Detection Dataset and Benchmark for Generalizations, Unfairness, and Stereotypes

Maximus Powers; Shaina Raza; Alex Chang; Rehana Riaz; Umang Mavani; Harshitha Reddy Jonala; Ansh Tiwari; Hua Wei

arXiv:2410.08388·cs.CL·September 17, 2025

Responsible AI in NLP: GUS-Net Span-Level Bias Detection Dataset and Benchmark for Generalizations, Unfairness, and Stereotypes

Maximus Powers, Shaina Raza, Alex Chang, Rehana Riaz, Umang Mavani, Harshitha Reddy Jonala, Ansh Tiwari, Hua Wei

PDF

Open Access 1 Repo 3 Models 3 Datasets

TL;DR

This paper introduces GUS-Net, a span-level bias detection dataset and benchmark for NLP, enabling more precise identification of social biases like stereotypes and unfairness in text.

Contribution

It presents a novel span-level annotation framework and a multi-label token classifier, improving bias detection granularity and interpretability in NLP models.

Findings

01

Encoder-based models outperform decoder-based models in bias detection tasks.

02

The GUS dataset contains 3,739 snippets with over 69,000 token annotations.

03

The framework facilitates systematic auditing and mitigation of biases in NLP systems.

Abstract

Representational harms in language technologies often occur in short spans within otherwise neutral text, where phrases may simultaneously convey generalizations, unfairness, or stereotypes. Framing bias detection as sentence-level classification obscures which words carry bias and what type is present, limiting both auditability and targeted mitigation. We introduce the GUS-Net Framework, comprising the GUS dataset and a multi-label token-level detector for span-level analysis of social bias. The GUS dataset contains 3,739 unique snippets across multiple domains, with over 69,000 token-level annotations. Each token is labeled using BIO tags (Begin, Inside, Outside) for three pathways of representational harm: Generalizations, Unfairness, and Stereotypes. To ensure reliable data annotation, we employ an automated multi-agent pipeline that proposes candidate spans which are subsequently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ethical-spectacle/fair-ly
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection