Artificial Intolerance: Stigmatizing Language in Clinical Documentation Skews Large Language Model Decision-Making

Jen-tse Huang; Didi Zhou; Faith Kamau; Amy Oh; Anne R. Links; Mark Dredze; Mary Catherine Beach; Somnath Saha

arXiv:2605.17228·cs.CL·May 19, 2026

Artificial Intolerance: Stigmatizing Language in Clinical Documentation Skews Large Language Model Decision-Making

Jen-tse Huang, Didi Zhou, Faith Kamau, Amy Oh, Anne R. Links, Mark Dredze, Mary Catherine Beach, Somnath Saha

PDF

TL;DR

This study reveals that large language models used in clinical settings are biased by stigmatizing language, which can significantly skew medical decision-making and are resistant to mitigation strategies.

Contribution

It systematically evaluates the bias of frontier LLMs in clinical contexts and highlights their vulnerability to stigmatizing language, exposing critical fairness issues.

Findings

01

All models show bias towards less aggressive patient management.

02

A single stigmatizing sentence can alter model outputs.

03

Standard mitigation strategies have limited effectiveness.

Abstract

Large Language Models (LLMs) are increasingly deployed in high-stakes domains such as clinical decision support and medical documentation. However, the robustness of these models against subtle linguistic variations, specifically stigmatizing language (SL) commonly found in human-authored clinical notes, remains critically under-explored. In this work, we investigate whether frontier LLMs inherit and propagate this human bias when processing clinical text. We systematically evaluate nine frontier LLMs across four stigmatized medical conditions, utilizing clinical vignettes injected with varying intensities and phenotypes of SL (doubt, blame, and maligning). Our results demonstrate that all evaluated models exhibit substantial bias, with clinical decision-making significantly skewed towards less aggressive patient management. Notably, we observe a high sensitivity to linguistic framing,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.