Memorizing SAM: 3D Medical Segment Anything Model with Memorizing Transformer
Xinyuan Shao, Yiqing Shen, and Mathias Unberath

TL;DR
This paper introduces Memorizing SAM, a 3D medical image segmentation model with a memory mechanism that enhances accuracy by recalling internal representations, outperforming existing models with minimal additional inference time.
Contribution
It proposes a novel memorizing Transformer plug-in for 3D SAM, improving segmentation accuracy in medical images with limited computational overhead.
Findings
Outperforms FastSAM3D with 11.36% higher Dice score
Increases inference time by only 4.38 milliseconds
Demonstrates effectiveness across 33 medical image categories
Abstract
Segment Anything Models (SAMs) have gained increasing attention in medical image analysis due to their zero-shot generalization capability in segmenting objects of unseen classes and domains when provided with appropriate user prompts. Addressing this performance gap is important to fully leverage the pre-trained weights of SAMs, particularly in the domain of volumetric medical image segmentation, where accuracy is important but well-annotated 3D medical data for fine-tuning is limited. In this work, we investigate whether introducing the memory mechanism as a plug-in, specifically the ability to memorize and recall internal representations of past inputs, can improve the performance of SAM with limited computation cost. To this end, we propose Memorizing SAM, a novel 3D SAM architecture incorporating a memory Transformer as a plug-in. Unlike conventional memorizing Transformers that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare · Big Data Technologies and Applications · Context-Aware Activity Recognition Systems
MethodsAttention Is All You Need · Linear Layer · Dropout · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Softmax
