Robust Instance Segmentation through Reasoning about Multi-Object Occlusion
Xiaoding Yuan, Adam Kortylewski, Yihong Sun, Alan Yuille

TL;DR
This paper introduces a deep neural network that improves multi-object instance segmentation under occlusion by reasoning about object relationships and occlusion order, trained with only bounding box supervision.
Contribution
It extends compositional networks to handle multiple objects and incorporates an Occlusion Reasoning Module for robust segmentation in occluded scenes.
Findings
Effective segmentation under occlusion demonstrated on KITTI and synthetic datasets.
Outperforms existing methods in occlusion robustness.
Provides a framework for occlusion-aware scene understanding.
Abstract
Analyzing complex scenes with Deep Neural Networks is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do not take into account the relative occlusion of nearby objects. In this paper, we propose a deep network for multi-object instance segmentation that is robust to occlusion and can be trained from bounding box supervision only. Our work builds on Compositional Networks, which learn a generative model of neural feature activations to locate occluders and to classify objects based on their non-occluded parts. We extend their generative model to include multiple objects and introduce a framework for efficient inference in challenging occlusion scenarios. In particular, we obtain feed-forward predictions of the object classes and their instance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
