From Cold Start to Active Learning: Embedding-Based Scan Selection for Medical Image Segmentation

Devon Levy; Bar Assayag; Laura Gaspar; Ilan Shimshoni; Bella Specktor-Fadida

arXiv:2601.18532·cs.CV·February 2, 2026

From Cold Start to Active Learning: Embedding-Based Scan Selection for Medical Image Segmentation

Devon Levy, Bar Assayag, Laura Gaspar, Ilan Shimshoni, Bella Specktor-Fadida

PDF

Open Access

TL;DR

This paper introduces a novel embedding-based cold-start sampling method combined with an uncertainty-driven active learning framework to improve medical image segmentation, especially in low-data regimes.

Contribution

It proposes a new cold-start sampling strategy using foundation-model embeddings and clustering, enhancing initial training diversity and representation.

Findings

01

Outperforms random selection in cold-start phase across datasets.

02

Improves segmentation accuracy and reduces Hausdorff distance in active learning.

03

Consistently outperforms baseline methods in low-data regimes.

Abstract

Accurate segmentation annotations are critical for disease monitoring, yet manual labeling remains a major bottleneck due to the time and expertise required. Active learning (AL) alleviates this burden by prioritizing informative samples for annotation, typically through a diversity-based cold-start phase followed by uncertainty-driven selection. We propose a novel cold-start sampling strategy that combines foundation-model embeddings with clustering, including automatic selection of the number of clusters and proportional sampling across clusters, to construct a diverse and representative initial training. This is followed by an uncertainty-based AL framework that integrates spatial diversity to guide sample selection. The proposed method is intuitive and interpretable, enabling visualization of the feature-space distribution of candidate samples. We evaluate our approach on three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning