SAM-I2I: Unleash the Power of Segment Anything Model for Medical Image Translation
Jiayu Huo, Sebastien Ourselin, Rachel Sparks

TL;DR
SAM-I2I leverages the Segment Anything Model 2 to improve medical image translation, capturing fine-grain semantic features for higher quality results compared to existing CNN and Transformer-based methods.
Contribution
The paper introduces SAM-I2I, a novel image-to-image translation framework using SAM2 for enhanced semantic feature extraction in medical imaging.
Findings
Outperforms state-of-the-art methods on MRI datasets
Provides more accurate and efficient image translation
Captures fine-grain semantic features effectively
Abstract
Medical image translation is crucial for reducing the need for redundant and expensive multi-modal imaging in clinical field. However, current approaches based on Convolutional Neural Networks (CNNs) and Transformers often fail to capture fine-grain semantic features, resulting in suboptimal image quality. To address this challenge, we propose SAM-I2I, a novel image-to-image translation framework based on the Segment Anything Model 2 (SAM2). SAM-I2I utilizes a pre-trained image encoder to extract multiscale semantic features from the source image and a decoder, based on the mask unit attention module, to synthesize target modality images. Our experiments on multi-contrast MRI datasets demonstrate that SAM-I2I outperforms state-of-the-art methods, offering more efficient and accurate medical image translation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Imaging and Analysis · Brain Tumor Detection and Classification
MethodsSoftmax · Attention Is All You Need
