From SAM to SAM 2: Exploring Improvements in Meta's Segment Anything   Model

Athulya Sundaresan Geetha; Muhammad Hussain

arXiv:2408.06305·cs.CV·August 13, 2024

From SAM to SAM 2: Exploring Improvements in Meta's Segment Anything Model

Athulya Sundaresan Geetha, Muhammad Hussain

PDF

Open Access

TL;DR

This paper discusses the evolution from SAM to SAM 2, highlighting improvements in video segmentation capabilities, real-time performance, and the potential impact on future computer vision applications.

Contribution

The paper introduces SAM 2, extending SAM's zero-shot segmentation to videos by utilizing temporal memory for improved accuracy and efficiency.

Findings

01

SAM 2 enables near real-time video segmentation.

02

Utilizes memory from adjacent frames for better accuracy.

03

Demonstrates significant improvements over original SAM.

Abstract

The Segment Anything Model (SAM), introduced to the computer vision community by Meta in April 2023, is a groundbreaking tool that allows automated segmentation of objects in images based on prompts such as text, clicks, or bounding boxes. SAM excels in zero-shot performance, segmenting unseen objects without additional training, stimulated by a large dataset of over one billion image masks. SAM 2 expands this functionality to video, leveraging memory from preceding and subsequent frames to generate accurate segmentation across entire videos, enabling near real-time performance. This comparison shows how SAM has evolved to meet the growing need for precise and efficient segmentation in various applications. The study suggests that future advancements in models like SAM will be crucial for improving computer vision technology.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Techniques and Practices · Big Data and Business Intelligence · Big Data Technologies and Applications

MethodsSegment Anything Model