DOC: Deep OCclusion Estimation From a Single Image
Peng Wang, Alan Yuille

TL;DR
This paper introduces DOC, a deep neural network architecture that estimates occlusion relationships from a single image by detecting object boundaries and border ownership, utilizing a large-scale dataset for training and outperforming existing methods.
Contribution
The paper presents a novel deep network architecture for occlusion estimation from a single image and introduces a large-scale dataset for training and evaluation.
Findings
DOC outperforms state-of-the-art methods on PIOD and BSDS datasets.
The large-scale PIOD dataset significantly enhances occlusion boundary detection.
Experiments show effective transfer learning between datasets.
Abstract
Recovering the occlusion relationships between objects is a fundamental human visual ability which yields important information about the 3D world. In this paper we propose a deep network architecture, called DOC, which acts on a single image, detects object boundaries and estimates the border ownership (i.e. which side of the boundary is foreground and which is background). We represent occlusion relations by a binary edge map, to indicate the object boundary, and an occlusion orientation variable which is tangential to the boundary and whose direction specifies border ownership by a left-hand rule. We train two related deep convolutional neural networks, called DOC, which exploit local and non-local image cues to estimate this representation and hence recover occlusion relations. In order to train and test DOC we construct a large-scale instance occlusion boundary dataset using PASCAL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
