Improving Generalizability of Extracting Social Determinants of Health   Using Large Language Models through Prompt-tuning

Cheng Peng; Zehao Yu; Kaleb E Smith; Wei-Hsuan Lo-Ciganic; Jiang Bian,; Yonghui Wu

arXiv:2403.12374·cs.CL·March 20, 2024·1 cites

Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning

Cheng Peng, Zehao Yu, Kaleb E Smith, Wei-Hsuan Lo-Ciganic, Jiang Bian,, Yonghui Wu

PDF

Open Access

TL;DR

This paper introduces a prompt-tuning approach for large language models to improve the extraction of social determinants of health across different domains, outperforming traditional fine-tuning methods.

Contribution

The study presents a novel soft prompt-based learning architecture for LLMs, demonstrating enhanced cross-domain generalizability in SDoH extraction compared to fine-tuning.

Findings

01

Decoder-only LLMs with prompt tuning outperform fine-tuned models.

02

GatorTronGPT achieved up to 21.8% higher F1 scores.

03

Prompt tuning improves cross-institution and cross-disease applications.

Abstract

The progress in natural language processing (NLP) using large language models (LLMs) has greatly improved patient information extraction from clinical narratives. However, most methods based on the fine-tuning strategy have limited transfer learning ability for cross-domain applications. This study proposed a novel approach that employs a soft prompt-based learning architecture, which introduces trainable prompts to guide LLMs toward desired outputs. We examined two types of LLM architectures, including encoder-only GatorTron and decoder-only GatorTronGPT, and evaluated their performance for the extraction of social determinants of health (SDoH) using a cross-institution dataset from the 2022 n2c2 challenge and a cross-disease dataset from the University of Florida (UF) Health. The results show that decoder-only LLMs with prompt tuning achieved better performance in cross-domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods