Deformable Part-based Fully Convolutional Network for Object Detection

Taylor Mordan; Nicolas Thome; Matthieu Cord; Gilles Henaff

arXiv:1707.06175·cs.CV·July 20, 2017

Deformable Part-based Fully Convolutional Network for Object Detection

Taylor Mordan, Nicolas Thome, Matthieu Cord, Gilles Henaff

PDF

Open Access

TL;DR

This paper introduces DP-FCN, a deformable part-based fully convolutional network that adapts to object shapes for improved detection accuracy, achieving state-of-the-art results on PASCAL VOC datasets.

Contribution

The paper presents a novel deformable part-based FCN architecture that learns to adapt to object shapes without extra annotations, enhancing detection and localization.

Findings

01

Achieves 83.1% mAP on PASCAL VOC 2007

02

Achieves 80.9% mAP on PASCAL VOC 2012

03

Significant performance improvements over previous methods

Abstract

Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultaneously brings more invariance for classification and geometric information to refine localization. DP-FCN is composed of three main modules: a Fully Convolutional Network to efficiently maintain spatial resolution, a deformable part-based RoI pooling layer to optimize positions of parts and build invariance, and a deformation-aware localization module explicitly exploiting displacements of parts to improve accuracy of bounding box regression. We experimentally validate our model and show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Face recognition and analysis · Advanced Image and Video Retrieval Techniques