Analyzing the Performance of Multilayer Neural Networks for Object   Recognition

Pulkit Agrawal; Ross Girshick; Jitendra Malik

arXiv:1407.1610·cs.CV·September 23, 2014·71 cites

Analyzing the Performance of Multilayer Neural Networks for Object Recognition

Pulkit Agrawal, Ross Girshick, Jitendra Malik

PDF

Open Access

TL;DR

This paper investigates the feature learning process of convolutional neural networks in object recognition, providing insights to help practitioners better understand and apply CNNs in computer vision tasks.

Contribution

It offers an experimental analysis of CNN feature learning, filling gaps in understanding compared to traditional features like SIFT and HOG.

Findings

01

CNN features outperform traditional methods on recognition tasks

02

Insights into how CNNs learn and represent features

03

Guidelines for applying CNNs effectively in vision problems

Abstract

In the last two years, convolutional neural networks (CNNs) have achieved an impressive suite of results on standard recognition datasets and tasks. CNN-based features seem poised to quickly replace engineered representations, such as SIFT and HOG. However, compared to SIFT and HOG, we understand much less about the nature of the features learned by large CNNs. In this paper, we experimentally probe several aspects of CNN feature learning in an attempt to help practitioners gain useful, evidence-backed intuitions about how to apply CNNs to computer vision problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods