Monocular Object Instance Segmentation and Depth Ordering with CNNs
Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun

TL;DR
This paper presents a CNN-based method for simultaneous instance segmentation and depth ordering from a single image, using a Markov random field to unify patch-based predictions, achieving strong results on the KITTI benchmark.
Contribution
The paper introduces a novel CNN and MRF framework for joint instance segmentation and depth ordering from monocular images, improving accuracy on challenging datasets.
Findings
Effective joint segmentation and depth ordering achieved
Strong performance demonstrated on KITTI benchmark
Patch-based CNN predictions refined by MRF improve results
Abstract
In this paper we tackle the problem of instance-level segmentation and depth ordering from a single monocular image. Towards this goal, we take advantage of convolutional neural nets and train them to directly predict instance-level segmentations where the instance ID encodes the depth ordering within image patches. To provide a coherent single explanation of an image we develop a Markov random field which takes as input the predictions of convolutional neural nets applied at overlapping patches of different resolutions, as well as the output of a connected component algorithm. It aims to predict accurate instance-level segmentation and depth ordering. We demonstrate the effectiveness of our approach on the challenging KITTI benchmark and show good performance on both tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Neural Network Applications
