Model Tuning or Prompt Tuning? A Study of Large Language Models for   Clinical Concept and Relation Extraction

Cheng Peng; Xi Yang; Kaleb E Smith; Zehao Yu; Aokun Chen; Jiang Bian,; Yonghui Wu

arXiv:2310.06239·cs.CL·April 16, 2024·1 cites

Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction

Cheng Peng, Xi Yang, Kaleb E Smith, Zehao Yu, Aokun Chen, Jiang Bian,, Yonghui Wu

PDF

Open Access

TL;DR

This study compares prompt tuning and traditional fine-tuning of large language models for clinical concept and relation extraction, showing that frozen large models excel in transfer and few-shot learning, especially when scaled up.

Contribution

It introduces a soft prompt-based learning approach for LLMs in clinical NLP and systematically compares training strategies, highlighting the effectiveness of frozen large models for transfer and few-shot learning.

Findings

01

Soft prompting with unfrozen LLMs achieves top performance.

02

Frozen large LLMs perform well in transfer and few-shot learning.

03

Scaling LLMs up improves frozen model competitiveness.

Abstract

Objective To develop soft prompt-based learning algorithms for large language models (LLMs), examine the shape of prompts, prompt-tuning using frozen/unfrozen LLMs, transfer learning, and few-shot learning abilities. Methods We developed a soft prompt-based LLM model and compared 4 training strategies including (1) fine-tuning without prompts; (2) hard-prompt with unfrozen LLMs; (3) soft-prompt with unfrozen LLMs; and (4) soft-prompt with frozen LLMs. We evaluated 7 pretrained LLMs using the 4 training strategies for clinical concept and relation extraction on two benchmark datasets. We evaluated the transfer learning ability of the prompt-based learning algorithms in a cross-institution setting. We also assessed the few-shot learning ability. Results and Conclusion When LLMs are unfrozen, GatorTron-3.9B with soft prompting achieves the best strict F1-scores of 0.9118 and 0.8604 for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques