Detecting HIV-Related Stigma in Clinical Narratives Using Large Language Models
Ziyi Chen, Yasir Khan, Mengyuan Zhang, Cheng Peng, Mengxian Lyu, Yiyang Liu, Krishna Vaddiparti, Robert L Cook, Mattia Prosperi, Yonghui Wu

TL;DR
This study develops and evaluates large language models to detect HIV-related stigma in clinical narratives, achieving promising results and highlighting challenges in zero-shot inference.
Contribution
It introduces the first NLP tool leveraging LLMs for identifying HIV stigma in clinical notes, with performance analysis across different models and stigma subscales.
Findings
GatorTron-large achieved the highest Micro F1 score of 0.62.
Few-shot prompting improved generative model performance significantly.
Negative Self-Image stigma was most predictable, Personalized Stigma was most challenging.
Abstract
Human immunodeficiency virus (HIV)-related stigma is a critical psychosocial determinant of health for people living with HIV (PLWH), influencing mental health, engagement in care, and treatment outcomes. Although stigma-related experiences are documented in clinical narratives, there is a lack of off-the-shelf tools to extract and categorize them. This study aims to develop a large language model (LLM)-based tool for identifying HIV stigma from clinical notes. We identified clinical notes from PLWH receiving care at the University of Florida (UF) Health between 2012 and 2022. Candidate sentences were identified using expert-curated stigma-related keywords and iteratively expanded via clinical word embeddings. A total of 1,332 sentences were manually annotated across four stigma subscales: Concern with Public Attitudes, Disclosure Concerns, Negative Self-Image, and Personalized Stigma.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
