CU-ICU: Customizing Unsupervised Instruction-Finetuned Language Models for ICU Datasets via Text-to-Text Transfer Transformer
Teerapong Panboonyuen

TL;DR
CU-ICU introduces a sparse fine-tuning method for adapting large language models to ICU healthcare data, significantly improving clinical prediction accuracy and interpretability with minimal parameter updates.
Contribution
It presents CU-ICU, a novel sparse fine-tuning approach that efficiently customizes instruction-tuned language models for ICU datasets using minimal supervision.
Findings
Up to 15% increase in sepsis detection accuracy
20% improvement in generating clinical explanations
Fewer than 1% model parameters updated
Abstract
Integrating large language models into specialized domains like healthcare presents unique challenges, including domain adaptation and limited labeled data. We introduce CU-ICU, a method for customizing unsupervised instruction-finetuned language models for ICU datasets by leveraging the Text-to-Text Transfer Transformer (T5) architecture. CU-ICU employs a sparse fine-tuning approach that combines few-shot prompting with selective parameter updates, enabling efficient adaptation with minimal supervision. Our evaluation across critical ICU tasks--early sepsis detection, mortality prediction, and clinical note generation--demonstrates that CU-ICU consistently improves predictive accuracy and interpretability over standard fine-tuning methods. Notably, CU-ICU achieves up to a 15% increase in sepsis detection accuracy and a 20% enhancement in generating clinically relevant explanations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Sepsis Diagnosis and Treatment
