SD-FSMIS: Adapting Stable Diffusion for Few-Shot Medical Image Segmentation

Meihua Li; Yang Zhang; Weizhao He; Hu Qu; Yisong Li

arXiv:2604.03134·cs.CV·April 10, 2026

SD-FSMIS: Adapting Stable Diffusion for Few-Shot Medical Image Segmentation

Meihua Li, Yang Zhang, Weizhao He, Hu Qu, Yisong Li

PDF

TL;DR

This paper introduces SD-FSMIS, a novel method that adapts the pre-trained Stable Diffusion model for few-shot medical image segmentation, leveraging visual priors for improved data efficiency and robustness.

Contribution

It proposes a new framework with support-query interaction and visual-to-textual translation modules to effectively adapt diffusion models for FSMIS tasks.

Findings

01

Achieves competitive results on standard FSMIS benchmarks.

02

Demonstrates strong generalization in cross-domain scenarios.

03

Highlights the potential of large-scale generative models in medical segmentation.

Abstract

Few-Shot Medical Image Segmentation (FSMIS) aims to segment novel object classes in medical images using only minimal annotated examples, addressing the critical challenges of data scarcity and domain shifts prevalent in medical imaging. While Diffusion Models (DM) excel in visual tasks, their potential for FSMIS remains largely unexplored. We propose that the rich visual priors learned by large-scale DMs offer a powerful foundation for a more robust and data-efficient segmentation approach. In this paper, we introduce SD-FSMIS, a novel framework designed to effectively adapt the powerful pre-trained Stable Diffusion (SD) model for the FSMIS task. Our approach repurposes its conditional generative architecture by introducing two key components: a Support-Query Interaction (SQI) and a Visual-to-Textual Condition Translator (VTCT). Specifically, SQI provides a straightforward yet powerful…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.