Amodal Ground Truth and Completion in the Wild

Guanqi Zhan; Chuanxia Zheng; Weidi Xie; Andrew Zisserman

arXiv:2312.17247·cs.CV·April 30, 2024·1 cites

Amodal Ground Truth and Completion in the Wild

Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman

PDF

Open Access 1 Repo

TL;DR

This paper introduces an automatic pipeline using 3D data to generate authentic amodal segmentation ground truth for occluded objects in real images, creating a new benchmark and improving state-of-the-art performance.

Contribution

It presents a novel automatic method for amodal ground truth generation and a new benchmark dataset, advancing amodal segmentation in real-world scenarios.

Findings

01

Achieved state-of-the-art results on amodal segmentation datasets.

02

Developed two architecture variants for amodal completion.

03

Created MP3D-Amodal, a new diverse amodal segmentation benchmark.

Abstract

This paper studies amodal image segmentation: predicting entire object segmentation masks including both visible and invisible (occluded) parts. In previous work, the amodal segmentation ground truth on real images is usually predicted by manual annotaton and thus is subjective. In contrast, we use 3D data to establish an automatic pipeline to determine authentic ground truth amodal masks for partially occluded objects in real images. This pipeline is used to construct an amodal completion evaluation benchmark, MP3D-Amodal, consisting of a variety of object categories and labels. To better handle the amodal completion task in the wild, we explore two architecture variants: a two-stage model that first infers the occluder, followed by amodal mask completion; and a one-stage model that exploits the representation power of Stable Diffusion for amodal segmentation across many categories.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Championchess/Amodal-Completion-in-the-Wild
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsDiffusion