About Pyramid Structure in Convolutional Neural Networks

Ihsan Ullah; Alfredo Petrosino

arXiv:1608.04064·cs.CV·August 16, 2016

About Pyramid Structure in Convolutional Neural Networks

Ihsan Ullah, Alfredo Petrosino

PDF

1 Repo

TL;DR

This paper explores the integration of pyramid structures in CNNs inspired by biological neurons, reducing parameters significantly while maintaining high accuracy across multiple datasets.

Contribution

It introduces a generalized framework for pyramid structures in CNNs that reduces parameters and disk size without sacrificing performance.

Findings

01

Over 80% parameter reduction in Caffe_LENET with maintained accuracy.

02

Achieved competitive results with 10-20% less training data and 10-40% fewer parameters in AlexNet.

03

Demonstrated effectiveness on MNIST, Cifar, and ImageNet datasets.

Abstract

Deep convolutional neural networks (CNN) brought revolution without any doubt to various challenging tasks, mainly in computer vision. However, their model designing still requires attention to reduce number of learnable parameters, with no meaningful reduction in performance. In this paper we investigate to what extend CNN may take advantage of pyramid structure typical of biological neurons. A generalized statement over convolutional layers from input till fully connected layer is introduced that helps further in understanding and designing a successful deep network. It reduces ambiguity, number of parameters, and their size on disk without degrading overall accuracy. Performance are shown on state-of-the-art models for MNIST, Cifar-10, Cifar-100, and ImageNet-12 datasets. Despite more than 80% reduction in parameters for Caffe_LENET, challenging results are obtained. Further, despite…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Adi-repo/Capstone_Project_2020
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.