Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models

Fardin Ahsan Sakib; Ziwei Zhu; Karen Trister Grace; Meliha Yetisgen; Ozlem Uzuner

arXiv:2506.00134·cs.CL·June 3, 2025

Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models

Fardin Ahsan Sakib, Ziwei Zhu, Karen Trister Grace, Meliha Yetisgen, Ozlem Uzuner

PDF

Open Access 1 Video

TL;DR

This paper investigates how large language models for social determinants of health extraction can rely on superficial cues, leading to false predictions and disparities, and explores mitigation strategies to improve their reliability in clinical text analysis.

Contribution

It identifies spurious correlations in LLM-based SDOH extraction and evaluates mitigation techniques like prompt engineering and reasoning chains to enhance model robustness.

Findings

01

Mentions of alcohol or smoking cause false drug use predictions.

02

Gender disparities affect model performance.

03

Mitigation strategies reduce false positives.

Abstract

Social determinants of health (SDOH) extraction from clinical text is critical for downstream healthcare analytics. Although large language models (LLMs) have shown promise, they may rely on superficial cues leading to spurious predictions. Using the MIMIC portion of the SHAC (Social History Annotation Corpus) dataset and focusing on drug status extraction as a case study, we demonstrate that mentions of alcohol or smoking can falsely induce models to predict current/past drug use where none is present, while also uncovering concerning gender disparities in model performance. We further evaluate mitigation strategies - such as prompt engineering and chain-of-thought reasoning - to reduce these false positives, providing insights into enhancing LLM reliability in health domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques