SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

Zhiling Yan; Sifan Song; Dingjie Song; Yiwei Li; Rong Zhou; Weixiang Sun; Zhennong Chen; Sekeun Kim; Hui Ren; Tianming Liu; Quanzheng Li; Xiang Li; Lifang He; and Lichao Sun

arXiv:2507.03698·cs.CV·July 8, 2025

SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

Zhiling Yan, Sifan Song, Dingjie Song, Yiwei Li, Rong Zhou, Weixiang Sun, Zhennong Chen, Sekeun Kim, Hui Ren, Tianming Liu, Quanzheng Li, Xiang Li, Lifang He, and Lichao Sun

PDF

1 Repo

TL;DR

SAMed-2 introduces a novel foundation model for medical image segmentation that leverages a temporal encoder and confidence-driven memory to handle diverse modalities and noisy data, outperforming existing methods.

Contribution

The paper presents SAMed-2, a new medical segmentation model with a temporal adapter and memory mechanism, trained on the MedBank-100k dataset, advancing multi-task medical image segmentation.

Findings

01

Outperforms state-of-the-art baselines in multiple benchmarks.

02

Effectively handles diverse modalities and noisy annotations.

03

Mitigates catastrophic forgetting in continual learning scenarios.

Abstract

Recent "segment anything" efforts show promise by learning from large-scale data, but adapting such models directly to medical images remains challenging due to the complexity of medical data, noisy annotations, and continual learning requirements across diverse modalities and anatomical structures. In this work, we propose SAMed-2, a new foundation model for medical image segmentation built upon the SAM-2 architecture. Specifically, we introduce a temporal adapter into the image encoder to capture image correlations and a confidence-driven memory mechanism to store high-certainty features for later retrieval. This memory-based strategy counters the pervasive noise in large-scale medical datasets and mitigates catastrophic forgetting when encountering new tasks or modalities. To train and evaluate SAMed-2, we curate MedBank-100k, a comprehensive dataset spanning seven imaging modalities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhilingyan/medical-sam-bench
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdapter