From Cold Start to Active Learning: Embedding-Based Scan Selection for Medical Image Segmentation
Devon Levy, Bar Assayag, Laura Gaspar, Ilan Shimshoni, Bella Specktor-Fadida

TL;DR
This paper introduces a novel embedding-based cold-start sampling method combined with an uncertainty-driven active learning framework to improve medical image segmentation, especially in low-data regimes.
Contribution
It proposes a new cold-start sampling strategy using foundation-model embeddings and clustering, enhancing initial training diversity and representation.
Findings
Outperforms random selection in cold-start phase across datasets.
Improves segmentation accuracy and reduces Hausdorff distance in active learning.
Consistently outperforms baseline methods in low-data regimes.
Abstract
Accurate segmentation annotations are critical for disease monitoring, yet manual labeling remains a major bottleneck due to the time and expertise required. Active learning (AL) alleviates this burden by prioritizing informative samples for annotation, typically through a diversity-based cold-start phase followed by uncertainty-driven selection. We propose a novel cold-start sampling strategy that combines foundation-model embeddings with clustering, including automatic selection of the number of clusters and proportional sampling across clusters, to construct a diverse and representative initial training. This is followed by an uncertainty-based AL framework that integrates spatial diversity to guide sample selection. The proposed method is intuitive and interpretable, enabling visualization of the feature-space distribution of candidate samples. We evaluate our approach on three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
