"The Dentist is an involved parent, the bartender is not": Revealing Implicit Biases in QA with Implicit BBQ

Aarushi Wagh; Saniya Srivastava

arXiv:2512.06732·cs.CL·December 9, 2025

"The Dentist is an involved parent, the bartender is not": Revealing Implicit Biases in QA with Implicit BBQ

Aarushi Wagh, Saniya Srivastava

PDF

Open Access

TL;DR

Implicit biases in large language models are often undetected by explicit benchmarks, and ImplicitBBQ provides a new evaluation tool to reveal these hidden biases across multiple categories.

Contribution

The paper introduces ImplicitBBQ, a benchmark for evaluating implicit biases in LLMs, extending existing fairness assessments to include implicit cues.

Findings

01

GPT-4o shows accuracy drops up to 7% on implicit bias prompts.

02

Implicit biases are prevalent in LLMs and go undetected by explicit benchmarks.

03

ImplicitBBQ enables more nuanced fairness evaluations in NLP.

Abstract

Existing benchmarks evaluating biases in large language models (LLMs) primarily rely on explicit cues, declaring protected attributes like religion, race, gender by name. However, real-world interactions often contain implicit biases, inferred subtly through names, cultural cues, or traits. This critical oversight creates a significant blind spot in fairness evaluation. We introduce ImplicitBBQ, a benchmark extending the Bias Benchmark for QA (BBQ) with implicitly cued protected attributes across 6 categories. Our evaluation of GPT-4o on ImplicitBBQ illustrates troubling performance disparity from explicit BBQ prompts, with accuracy declining up to 7% in the "sexual orientation" subcategory and consistent decline located across most other categories. This indicates that current LLMs contain implicit biases undetected by explicit benchmarks. ImplicitBBQ offers a crucial tool for nuanced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Authorship Attribution and Profiling · Ethics and Social Impacts of AI