T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
Raza Imam, Hu Wang, Dwarikanath Mahapatra, Mohammad Yaqub

TL;DR
This paper introduces T^3, a dynamic, test-time model merging method for vision-language models in medical imaging that adapts to each sample, improving accuracy and robustness across diverse modalities without extra training.
Contribution
The paper proposes T^3, a novel, sample-adaptive merging framework for medical vision-language models that enhances performance and reliability in clinical tasks.
Findings
Sets new state-of-the-art accuracy in medical imaging tasks.
Efficient batch-wise extension reduces computational costs.
Demonstrates robustness across multiple medical imaging modalities.
Abstract
In medical imaging, vision-language models face a critical duality: pretrained networks offer broad robustness but lack subtle, modality-specific characteristics, while fine-tuned expert models achieve high in-distribution accuracy yet falter under modality shift. Existing model-merging techniques, designed for natural-image benchmarks, are simple and efficient but fail to deliver consistent gains across diverse medical modalities; their static interpolation limits reliability in varied clinical tasks. To address this, we introduce Test-Time Task adaptive merging (T^3), a backpropagation-free framework that computes per-sample interpolation coefficients via the Jensen-Shannon divergence between the two models' output distributions. T^3 dynamically preserves local precision when models agree and defers to generalist robustness under drift. To overcome the inference costs of sample-wise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
