TL;DR
This paper introduces the first end-to-end trainable model for semantic amodal segmentation, capable of predicting visible and invisible object regions in a single pass, outperforming existing baselines on multiple datasets.
Contribution
The authors present a novel all-in-one model for semantic amodal segmentation, including new datasets and data augmentation techniques to enhance performance without extensive amodal training data.
Findings
Model outperforms current baseline on COCO amodal dataset
Provides strong baseline results on new D2S amodal and COCOA cls datasets
Achieves reasonable amodal segmentation performance with data augmentation
Abstract
Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for semantic amodal segmentation that predicts the amodal instance masks as well as their visible and invisible part in a single forward pass. In a detailed analysis, we provide experiments to show which architecture choices are beneficial for an all-in-one amodal segmentation model. On the COCO amodal dataset, our model outperforms the current baseline for amodal segmentation by a large margin. To further evaluate our model, we provide two new datasets with ground truth for semantic amodal segmentation, D2S amodal and COCOA cls. For both datasets, our model provides a strong baseline performance. Using special data augmentation techniques, we show that amodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
