Towards Better Ultrasound Video Segmentation Foundation Model: An Empirical study on SAM2 Finetuning from Data Perspective
Xing Yao, Ahana Gangopadhyay, Hsi-Ming Chang, Ravi Soni

TL;DR
This study systematically investigates how data characteristics like size, duration, and augmentation influence the adaptation of SAM2 foundation model for ultrasound video segmentation, emphasizing data-centric strategies over architectural changes.
Contribution
It provides a comprehensive analysis of data factors affecting SAM2 adaptation in ultrasound, highlighting the importance of data scale and temporal context over model architecture.
Findings
Data scale and temporal context are more influential than architecture.
Joint training balances modality alignment and task specificity.
Ultrasound-specific augmentations improve segmentation performance.
Abstract
Ultrasound (US) video segmentation remains a challenging problem due to strong inter- and intra-dataset variability, motion artifacts, and limited annotated data. Although foundation models such as Segment Anything Model 2 (SAM2) demonstrate strong zero-shot and prompt-guided segmentation capabilities, their performance deteriorates substantially when transferred to medical imaging domains. Current adaptation studies mainly emphasize architectural modifications, while the influence of data characteristics and training regimes has not been systematically examined. In this study, we present a comprehensive, data-centric investigation of SAM2 adaptation for ultrasound video segmentation. We analyze how training-set size, video duration, and augmentation schemes affect adaptation performance under three paradigms: task-specific fine-tuning, intermediate adaptation, and multi-task joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
