Learning to See the Invisible: End-to-End Trainable Amodal Instance   Segmentation

Patrick Follmann; Rebecca K\"onig; Philipp H\"artinger; Michael; Klostermann

arXiv:1804.08864·cs.CV·April 25, 2018

Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation

Patrick Follmann, Rebecca K\"onig, Philipp H\"artinger, Michael, Klostermann

PDF

2 Repos

TL;DR

This paper introduces the first end-to-end trainable model for semantic amodal segmentation, capable of predicting visible and invisible object regions in a single pass, outperforming existing baselines on multiple datasets.

Contribution

The authors present a novel all-in-one model for semantic amodal segmentation, including new datasets and data augmentation techniques to enhance performance without extensive amodal training data.

Findings

01

Model outperforms current baseline on COCO amodal dataset

02

Provides strong baseline results on new D2S amodal and COCOA cls datasets

03

Achieves reasonable amodal segmentation performance with data augmentation

Abstract

Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for semantic amodal segmentation that predicts the amodal instance masks as well as their visible and invisible part in a single forward pass. In a detailed analysis, we provide experiments to show which architecture choices are beneficial for an all-in-one amodal segmentation model. On the COCO amodal dataset, our model outperforms the current baseline for amodal segmentation by a large margin. To further evaluate our model, we provide two new datasets with ground truth for semantic amodal segmentation, D2S amodal and COCOA cls. For both datasets, our model provides a strong baseline performance. Using special data augmentation techniques, we show that amodal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.