segDeepM: Exploiting Segmentation and Context in Deep Neural Networks   for Object Detection

Yukun Zhu; Raquel Urtasun; Ruslan Salakhutdinov; Sanja Fidler

arXiv:1502.04275·cs.CV·February 17, 2015·52 cites

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

Yukun Zhu, Raquel Urtasun, Ruslan Salakhutdinov, Sanja Fidler

PDF

Open Access

TL;DR

This paper introduces segDeepM, a method that enhances object detection accuracy by integrating segmentation and contextual cues within a Markov Random Field framework, leveraging CNNs to select and score segmentation proposals.

Contribution

It presents a novel approach combining segmentation and context in deep neural networks for improved object detection accuracy.

Findings

01

Achieved 4.1% improvement in mAP over R-CNN on PASCAL VOC 2010

02

Surpassed current state-of-the-art by 3.4% in mAP

03

Demonstrated the effectiveness of segmentation and context integration

Abstract

In this paper, we propose an approach that exploits object segmentation in order to improve the accuracy of object detection. We frame the problem as inference in a Markov Random Field, in which each detection hypothesis scores object appearance as well as contextual information using Convolutional Neural Networks, and allows the hypothesis to choose and score a segment out of a large pool of accurate object segmentation proposals. This enables the detector to incorporate additional evidence when it is available and thus results in more accurate detections. Our experiments show an improvement of 4.1% in mAP over the R-CNN baseline on PASCAL VOC 2010, and 3.4% over the current state-of-the-art, demonstrating the power of our approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsSupport Vector Machine · Max Pooling · Convolution · R-CNN