Object detection via a multi-region & semantic segmentation-aware CNN   model

Spyros Gidaris; Nikos Komodakis

arXiv:1505.01749·cs.CV·September 25, 2015

Object detection via a multi-region & semantic segmentation-aware CNN model

Spyros Gidaris, Nikos Komodakis

PDF

1 Repo

TL;DR

This paper introduces a multi-region and semantic segmentation-aware CNN for object detection, achieving high localization accuracy and surpassing previous methods on PASCAL VOC datasets.

Contribution

The paper presents a novel CNN-based object detection system that combines multi-region features with semantic segmentation awareness and an iterative localization mechanism.

Findings

01

Achieved 78.2% mAP on PASCAL VOC2007

02

Achieved 73.9% mAP on PASCAL VOC2012

03

Surpassed previous state-of-the-art results significantly

Abstract

We propose an object detection system that relies on a multi-region deep convolutional neural network (CNN) that also encodes semantic segmentation-aware features. The resulting CNN-based representation aims at capturing a diverse set of discriminative appearance factors and exhibits localization sensitivity that is essential for accurate object localization. We exploit the above properties of our recognition module by integrating it on an iterative localization mechanism that alternates between scoring a box proposal and refining its location with a deep CNN regression model. Thanks to the efficient use of our modules, we detect objects with very high localization accuracy. On the detection challenges of PASCAL VOC2007 and PASCAL VOC2012 we achieve mAP of 78.2% and 73.9% correspondingly, surpassing any other published work by a significant margin.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gidariss/mrcnn-object-detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.