TL;DR
SAMed-2 introduces a novel foundation model for medical image segmentation that leverages a temporal encoder and confidence-driven memory to handle diverse modalities and noisy data, outperforming existing methods.
Contribution
The paper presents SAMed-2, a new medical segmentation model with a temporal adapter and memory mechanism, trained on the MedBank-100k dataset, advancing multi-task medical image segmentation.
Findings
Outperforms state-of-the-art baselines in multiple benchmarks.
Effectively handles diverse modalities and noisy annotations.
Mitigates catastrophic forgetting in continual learning scenarios.
Abstract
Recent "segment anything" efforts show promise by learning from large-scale data, but adapting such models directly to medical images remains challenging due to the complexity of medical data, noisy annotations, and continual learning requirements across diverse modalities and anatomical structures. In this work, we propose SAMed-2, a new foundation model for medical image segmentation built upon the SAM-2 architecture. Specifically, we introduce a temporal adapter into the image encoder to capture image correlations and a confidence-driven memory mechanism to store high-certainty features for later retrieval. This memory-based strategy counters the pervasive noise in large-scale medical datasets and mitigates catastrophic forgetting when encountering new tasks or modalities. To train and evaluate SAMed-2, we curate MedBank-100k, a comprehensive dataset spanning seven imaging modalities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdapter
