Segment Anything
Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe, Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg,, Wan-Yen Lo, Piotr Doll\'ar, Ross Girshick

TL;DR
The paper presents the Segment Anything project, introducing a new image segmentation task, a large-scale dataset with over 1 billion masks, and a versatile promptable model that performs well in zero-shot settings across various tasks.
Contribution
It introduces the largest segmentation dataset to date and a promptable model capable of zero-shot transfer, advancing foundation models in computer vision.
Findings
Zero-shot performance is often competitive with or better than supervised methods.
The dataset contains over 1 billion masks on 11 million images.
The model demonstrates strong transferability across diverse segmentation tasks.
Abstract
We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗facebook/sam-vit-basemodel· 281k dl· ♡ 164281k dl♡ 164
- 🤗ybelkada/segment-anythingmodel· ♡ 123♡ 123
- 🤗facebook/sam-vit-hugemodel· 194k dl· ♡ 192194k dl♡ 192
- 🤗facebook/sam-vit-largemodel· 12k dl· ♡ 3312k dl♡ 33
- 🤗wanglab/medsam-vit-basemodel· 2.0k dl· ♡ 242.0k dl♡ 24
- 🤗timm/samvit_base_patch16.sa1bmodel· 34k dl· ♡ 134k dl♡ 1
- 🤗timm/samvit_huge_patch16.sa1bmodel· 108 dl· ♡ 1108 dl♡ 1
- 🤗timm/samvit_large_patch16.sa1bmodel· 168 dl168 dl
- 🤗dhkim2810/MobileSAMmodel· ♡ 27♡ 27
- 🤗giladvdn/test-sam-handlermodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · COVID-19 diagnosis using AI
MethodsSegment Anything Model
