From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes

Fariba Afrin Irany; Sampson Akwafuo

arXiv:2601.21955·cs.CL·March 3, 2026

From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes

Fariba Afrin Irany, Sampson Akwafuo

PDF

Open Access

TL;DR

This paper introduces a GPT-based architecture with selective fine-tuning for efficient and accurate clinical text classification in EHR notes, addressing challenges like limited labeled data and class imbalance.

Contribution

The study proposes a novel selective fine-tuning strategy for GPT-2 that reduces training complexity while maintaining high performance on clinical text classification tasks.

Findings

01

Stable convergence across dataset sizes

02

Strong performance in non-mention and negated findings

03

Effective adaptation with reduced computational cost

Abstract

The increasing availability of unstructured clinical narratives in electronic health records (EHRs) has created new opportunities for automated disease characterization, cohort identification, and clinical decision support. However, modeling long, domain-specific clinical text remains challenging due to limited labeled data, severe class imbalance, and the high computational cost of adapting large pretrained language models. This study presents a GPT-based architecture for clinical text classification that adapts a pretrained decoder-only Transformer using a selective fine-tuning strategy. Rather than updating all model parameters, the majority of the GPT-2 backbone is frozen, and training is restricted to the final Transformer block, the final layer normalization, and a lightweight classification head. This approach substantially reduces the number of trainable parameters while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Artificial Intelligence in Healthcare and Education