Training-Free Adaptation of New-Generation LLMs using Legacy Clinical Models

Sasha Ronaghi; Chloe Stanwyck; Asad Aali; Amir Ronaghi; Miguel Fuentes; Tina Hernandez-Boussard; Emily Alsentzer

arXiv:2601.03423·cs.CL·April 30, 2026

Training-Free Adaptation of New-Generation LLMs using Legacy Clinical Models

Sasha Ronaghi, Chloe Stanwyck, Asad Aali, Amir Ronaghi, Miguel Fuentes, Tina Hernandez-Boussard, Emily Alsentzer

PDF

TL;DR

The paper introduces CAPT, a training-free method for adapting new-generation language models to clinical tasks by ensembling with legacy models, improving clinical relevance without retraining.

Contribution

CAPT enables training-free adaptation of modern language models to clinical domains using existing models, outperforming previous ensembling methods across multiple tasks.

Findings

01

CAPT outperforms individual models and state-of-the-art ensembling methods by large margins.

02

CAPT enhances clinical specificity and reduces errors in language generation.

03

CAPT benefits healthcare settings with limited computational resources.

Abstract

Adapting language models to the clinical domain through continued pretraining and instruction tuning requires costly retraining for each new model generation. We propose Cross-Architecture Proxy Tuning (CAPT), a model-ensembling approach that enables training-free adaptation of state-of-the-art general-domain models using existing clinical models. CAPT supports models with disjoint vocabularies, leveraging contrastive decoding to selectively inject clinically relevant signals while preserving the general-domain model's reasoning and fluency. On six clinical classification and text-generation tasks, CAPT with a new-generation general-domain model and an older-generation clinical model consistently outperforms both models individually and state-of-the-art ensembling approaches (average +17.6\% over UniTE, +41.4\% over proxy tuning across tasks). Through token-level analysis and physician…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.