Amodal SAM: A Unified Amodal Segmentation Framework with Generalization

Bo Zhang; Zhuotao Tian; Xin Tao; Songlin Tang; Jun Yu; Wenjie Pei

arXiv:2604.20748·cs.CV·April 23, 2026

Amodal SAM: A Unified Amodal Segmentation Framework with Generalization

Bo Zhang, Zhuotao Tian, Xin Tao, Songlin Tang, Jun Yu, Wenjie Pei

PDF

TL;DR

Amodal SAM is a unified framework that enhances generalization in amodal segmentation by leveraging SAM, synthetic data generation, and novel learning objectives, achieving state-of-the-art results.

Contribution

It introduces a comprehensive amodal segmentation framework combining SAM with new modules for occlusion handling and data synthesis, improving generalization and performance.

Findings

01

Achieves state-of-the-art performance on standard benchmarks.

02

Demonstrates robust generalization to unseen object categories.

03

Effective in both image and video amodal segmentation.

Abstract

Amodal segmentation is a challenging task that aims to predict the complete geometric shape of objects, including their occluded regions. Although existing methods primarily focus on amodal segmentation within the training domain, these approaches often lack the generalization capacity to extend effectively to novel object categories and unseen contexts. This paper introduces Amodal SAM, a unified framework that leverages SAM (Segment Anything Model) for both amodal image and amodal video segmentation. Amodal SAM preserves the powerful generalization ability of SAM while extending its inherent capabilities to the amodal segmentation task. The improvements lie in three aspects: (1) a lightweight Spatial Completion Adapter that enables occluded region reconstruction, (2) a Target-Aware Occlusion Synthesis (TAOS) pipeline that addresses the scarcity of amodal annotations by generating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.