AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt   Encoder

Tal Shaharabany; Aviad Dahan; Raja Giryes; Lior Wolf

arXiv:2306.06370·cs.CV·June 13, 2023·39 cites

AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder

Tal Shaharabany, Aviad Dahan, Raja Giryes, Lior Wolf

PDF

Open Access

TL;DR

AutoSAM adapts the Segment Anything Model for medical images by replacing its conditioning mechanism with an image encoder, achieving state-of-the-art results without additional fine-tuning.

Contribution

The paper introduces a new encoder that replaces SAM's conditioning, enabling effective medical image segmentation without fine-tuning SAM.

Findings

01

Achieves state-of-the-art results on medical image benchmarks.

02

Enables fully automatic segmentation without manual prompts.

03

Operates effectively without fine-tuning SAM.

Abstract

The recently introduced Segment Anything Model (SAM) combines a clever architecture and large quantities of training data to obtain remarkable image segmentation capabilities. However, it fails to reproduce such results for Out-Of-Distribution (OOD) domains such as medical images. Moreover, while SAM is conditioned on either a mask or a set of points, it may be desirable to have a fully automatic solution. In this work, we replace SAM's conditioning with an encoder that operates on the same input image. By adding this encoder and without further fine-tuning SAM, we obtain state-of-the-art results on multiple medical images and video benchmarks. This new encoder is trained via gradients provided by a frozen SAM. For inspecting the knowledge within it, and providing a lightweight segmentation solution, we also learn to decode it into a mask by a shallow deconvolution network.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · COVID-19 diagnosis using AI

MethodsSegment Anything Model