A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

Suraj Srinivas; Ravi Kiran Sarvadevabhatla; Konda Reddy Mopuri; Nikita; Prabhu; Srinivas S S Kruthiventi; R. Venkatesh Babu

arXiv:1601.06615·cs.CV·January 26, 2016

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

Suraj Srinivas, Ravi Kiran Sarvadevabhatla, Konda Reddy Mopuri, Nikita, Prabhu, Srinivas S S Kruthiventi, R. Venkatesh Babu

PDF

TL;DR

This paper provides a comprehensive taxonomy and survey of convolutional neural network architectures tailored for computer vision, highlighting their evolution, variations, and suitability for different tasks to guide practitioners.

Contribution

It offers a structured taxonomy of CNN architectures for computer vision, filling the gap of a dedicated survey and aiding practitioners in selecting appropriate models.

Findings

01

Analyzes the evolution of CNN architectures from AlexNet onwards.

02

Classifies CNN variations based on their design and application.

03

Serves as a practical guide for novice practitioners in deep learning for vision.

Abstract

Traditional architectures for solving computer vision problems and the degree of success they enjoyed have been heavily reliant on hand-crafted features. However, of late, deep learning techniques have offered a compelling alternative -- that of automatically learning problem-specific features. With this new paradigm, every problem in computer vision is now being re-examined from a deep learning perspective. Therefore, it has become important to understand what kind of deep networks are suitable for a given problem. Although general surveys of this fast-moving paradigm (i.e. deep-networks) exist, a survey specific to computer vision is missing. We specifically consider one form of deep networks widely used in computer vision - convolutional neural networks (CNNs). We start with "AlexNet" as our base CNN and then examine the broad variations proposed over time to suit different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.