PVANET: Deep but Lightweight Neural Networks for Real-time Object   Detection

Kye-Hyeon Kim; Sanghoon Hong; Byungseok Roh; Yeongjae Cheon; Minje; Park

arXiv:1608.08021·cs.CV·October 3, 2016·202 cites

PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection

Kye-Hyeon Kim, Sanghoon Hong, Byungseok Roh, Yeongjae Cheon, Minje, Park

PDF

Open Access 2 Repos

TL;DR

PVANET is a deep, lightweight neural network designed for real-time multi-category object detection, achieving high accuracy with significantly reduced computational cost on standard benchmarks.

Contribution

The paper introduces a novel deep and thin network architecture that combines recent innovations to optimize accuracy and efficiency in object detection tasks.

Findings

01

83.8% mAP on VOC2007

02

82.5% mAP on VOC2012

03

750ms/image on CPU, 46ms/image on GPU

Abstract

This paper presents how we can achieve the state-of-the-art accuracy in multi-category object detection task while minimizing the computational cost by adapting and combining recent technical innovations. Following the common pipeline of "CNN feature extraction + region proposal + RoI classification", we mainly redesign the feature extraction part, since region proposal part is not computationally expensive and classification part can be efficiently compressed with common techniques like truncated SVD. Our design principle is "less channels with more layers" and adoption of some building blocks including concatenated ReLU, Inception, and HyperNet. The designed network is deep and thin and trained with the help of batch normalization, residual connections, and learning rate scheduling based on plateau detection. We obtained solid results on well-known object detection benchmarks: 83.8%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Video Surveillance and Tracking Methods

Methods*Communicated@Fast*How Do I Communicate to Expedia?