TAP-SLF: Parameter-Efficient Adaptation of Vision Foundation Models for Multi-Task Ultrasound Image Analysis
Hui Wan, Libin Lan

TL;DR
This paper introduces TAP-SLF, a parameter-efficient framework for multi-task ultrasound image analysis that uses task-aware prompts and selective layer fine-tuning to adapt vision foundation models effectively while reducing overfitting and computational costs.
Contribution
The paper proposes a novel unified approach combining task-aware soft prompts and selective high-layer fine-tuning for efficient multi-task medical image analysis with vision foundation models.
Findings
TAP-SLF achieves competitive results on the FMC_UIA 2026 Challenge.
Task-aware prompts improve task-specific adaptation.
Selective layer fine-tuning reduces computational costs.
Abstract
Executing multiple tasks simultaneously in medical image analysis, including segmentation, classification, detection, and regression, often introduces significant challenges regarding model generalizability and the optimization of shared feature representations. While Vision Foundation Models (VFMs) provide powerful general representations, full fine-tuning on limited medical data is prone to overfitting and incurs high computational costs. Moreover, existing parameter-efficient fine-tuning approaches typically adopt task-agnostic adaptation protocols, overlooking both task-specific mechanisms and the varying sensitivity of model layers during fine-tuning. In this work, we propose Task-Aware Prompting and Selective Layer Fine-Tuning (TAP-SLF), a unified framework for multi-task ultrasound image analysis. TAP-SLF incorporates task-aware soft prompts to encode task-specific priors into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Ultrasound Imaging and Elastography · Generative Adversarial Networks and Image Synthesis
