SemiSAM-O1: How far can we push the boundary of annotation-efficient medical image segmentation?

Yichi Zhang; Le Xue; Bichun Xu; Judong Luo; Zhigang Wu; Yu Fu; Zixin Hu; Yuan Cheng; Yuan Qi

arXiv:2604.24109·cs.CV·April 28, 2026

SemiSAM-O1: How far can we push the boundary of annotation-efficient medical image segmentation?

Yichi Zhang, Le Xue, Bichun Xu, Judong Luo, Zhigang Wu, Yu Fu, Zixin Hu, Yuan Cheng, Yuan Qi

PDF

TL;DR

SemiSAM-O1 introduces a novel one-annotated-template framework for medical image segmentation, leveraging foundation models and iterative refinement to achieve near full-supervision performance with minimal annotation.

Contribution

It extends foundation model-driven semi-supervised learning to the extreme one-label setting, combining feature-based pseudo-labeling and uncertainty-guided refinement.

Findings

01

SemiSAM-O1 narrows the performance gap to fully supervised models.

02

It reduces computational overhead compared to online foundation model inference.

03

The method is effective across diverse modalities and anatomical targets.

Abstract

Semi-supervised learning (SSL) has become a promising solution to alleviate the annotation burden of deep learning-based medical image segmentation models. While recent advances in foundation model-driven SSL have pushed the boundary to extremely limited annotation scenarios, they fail to maintain robust competitive performance in complex imaging modalities. In this paper, we propose SemiSAM-O1, an annotation-efficient framework using only one annotated template image for segmentation. SemiSAM-O1 extends the specialist-generalist collaborative learning framework to the extreme one-label setting by fully exploiting the foundation model's feature representation capability beyond its prompting interface. SemiSAM-O1 operates in two stages. In the first stage, the foundation model's encoder extracts dense features from all volumes, and class prototypes derived from the single annotated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.