Fast R-CNN

Ross Girshick

arXiv:1504.08083·cs.CV·September 29, 2015·278 cites

Fast R-CNN

Ross Girshick

PDF

Open Access 5 Repos 2 Models

TL;DR

Fast R-CNN introduces a highly efficient object detection method that significantly accelerates training and testing times while improving accuracy, by building on deep convolutional networks and innovative training techniques.

Contribution

Fast R-CNN presents a novel framework that greatly speeds up training and testing of deep object detectors, outperforming previous methods like R-CNN and SPPnet.

Findings

01

Training VGG16 9x faster than R-CNN

02

Testing speed 213x faster than R-CNN

03

Higher mAP on PASCAL VOC 2012

Abstract

This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection. Fast R-CNN builds on previous work to efficiently classify object proposals using deep convolutional networks. Compared to previous work, Fast R-CNN employs several innovations to improve training and testing speed while also increasing detection accuracy. Fast R-CNN trains the very deep VGG16 network 9x faster than R-CNN, is 213x faster at test-time, and achieves a higher mAP on PASCAL VOC 2012. Compared to SPPnet, Fast R-CNN trains VGG16 3x faster, tests 10x faster, and is more accurate. Fast R-CNN is implemented in Python and C++ (using Caffe) and is available under the open-source MIT License at https://github.com/rbgirshick/fast-rcnn.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Brain Tumor Detection and Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Softmax · Convolution · RoIPool · Fast R-CNN