Monocular Object Instance Segmentation and Depth Ordering with CNNs

Ziyu Zhang; Alexander G. Schwing; Sanja Fidler; Raquel Urtasun

arXiv:1505.03159·cs.CV·December 21, 2015·34 cites

Monocular Object Instance Segmentation and Depth Ordering with CNNs

Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun

PDF

Open Access

TL;DR

This paper presents a CNN-based method for simultaneous instance segmentation and depth ordering from a single image, using a Markov random field to unify patch-based predictions, achieving strong results on the KITTI benchmark.

Contribution

The paper introduces a novel CNN and MRF framework for joint instance segmentation and depth ordering from monocular images, improving accuracy on challenging datasets.

Findings

01

Effective joint segmentation and depth ordering achieved

02

Strong performance demonstrated on KITTI benchmark

03

Patch-based CNN predictions refined by MRF improve results

Abstract

In this paper we tackle the problem of instance-level segmentation and depth ordering from a single monocular image. Towards this goal, we take advantage of convolutional neural nets and train them to directly predict instance-level segmentations where the instance ID encodes the depth ordering within image patches. To provide a coherent single explanation of an image we develop a Markov random field which takes as input the predictions of convolutional neural nets applied at overlapping patches of different resolutions, as well as the output of a connected component algorithm. It aims to predict accurate instance-level segmentation and depth ordering. We demonstrate the effectiveness of our approach on the challenging KITTI benchmark and show good performance on both tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Neural Network Applications