Segment Anything

Alexander Kirillov; Eric Mintun; Nikhila Ravi; Hanzi Mao; Chloe; Rolland; Laura Gustafson; Tete Xiao; Spencer Whitehead; Alexander C. Berg,; Wan-Yen Lo; Piotr Doll\'ar; Ross Girshick

arXiv:2304.02643·cs.CV·April 6, 2023·528 cites

Segment Anything

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe, Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg,, Wan-Yen Lo, Piotr Doll\'ar, Ross Girshick

PDF

Open Access 5 Repos 10 Models 5 Datasets

TL;DR

The paper presents the Segment Anything project, introducing a new image segmentation task, a large-scale dataset with over 1 billion masks, and a versatile promptable model that performs well in zero-shot settings across various tasks.

Contribution

It introduces the largest segmentation dataset to date and a promptable model capable of zero-shot transfer, advancing foundation models in computer vision.

Findings

01

Zero-shot performance is often competitive with or better than supervised methods.

02

The dataset contains over 1 billion masks on 11 million images.

03

The model demonstrates strong transferability across diverse segmentation tasks.

Abstract

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · COVID-19 diagnosis using AI

MethodsSegment Anything Model