A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

Cheng Peng; Mengxian Lyu; Ziyi Chen; Yonghui Wu

arXiv:2604.06650·cs.CL·April 9, 2026

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

Cheng Peng, Mengxian Lyu, Ziyi Chen, Yonghui Wu

PDF

TL;DR

This paper introduces a multitask prompt distillation framework that efficiently adapts to various clinical NLP tasks using minimal trainable parameters, outperforming existing methods across multiple models and datasets.

Contribution

It proposes a shared metaprompt learning approach that significantly reduces parameter overhead while enhancing transferability and performance in clinical NLP tasks.

Findings

01

Outperforms LoRA by 1.5~1.7% with fewer parameters

02

Exceeds single-task prompt tuning by 6.1~6.6%

03

Achieves highest performance with gpt-oss 20B model

Abstract

Existing prompt-based fine-tuning methods typically learn task-specific prompts independently, imposing significant computing and storage overhead at scale when deploying multiple clinical natural language processing (NLP) systems. We present a multitask prompt distillation and decomposition framework that learns a single shared metaprompt from 21 diverse clinical source tasks and adapts it to unseen target tasks with fewer than 0.05% trainable parameters. Evaluated across five clinical NLP task types (named entity recognition, relation extraction, question answering, natural language inference, and summarization) on 10 held-out target datasets using three backbone models (LLaMA 3.1 8B, Meditron3 8B, gpt-oss 20B), our framework consistently outperforms LoRA by 1.5~1.7% despite using orders of magnitude fewer parameters, and exceeds single-task prompt tuning by 6.1~6.6%. The gpt-oss 20B…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.