Convolutional Neural Networks In Convolution

Xiaobo Huang

arXiv:1810.03946·cs.CV·October 10, 2018·1 cites

Convolutional Neural Networks In Convolution

Xiaobo Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces CNN In Convolution (CNNIC), a novel wider CNN architecture that uses small CNNs as convolutional kernels to improve accuracy without data transmutation, demonstrated on MNIST.

Contribution

The paper proposes a new CNN architecture, CNNIC, that employs small CNNs as kernels, enhancing accuracy and training stability over traditional models.

Findings

01

Achieved high classification accuracy on MNIST.

02

Utilized dropout and orthonormal initialization for better training.

03

Demonstrated the effectiveness of CNNIC architecture.

Abstract

Currently, increasingly deeper neural networks have been applied to improve their accuracy. In contrast, We propose a novel wider Convolutional Neural Networks (CNN) architecture, motivated by the Multi-column Deep Neural Networks and the Network In Network(NIN), aiming for higher accuracy without input data transmutation. In our architecture, namely "CNN In Convolution"(CNNIC), a small CNN, instead of the original generalized liner model(GLM) based filters, is convoluted as kernel on the original image, serving as feature extracting layer of this networks. And further classifications are then carried out by a global average pooling layer and a softmax layer. Dropout and orthonormal initialization are applied to overcome training difficulties including slow convergence and over-fitting. Persuasive classification performance is demonstrated on MNIST.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MyWorkShop/Convolutional-Neural-Networks-in-Convolution
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning

MethodsSoftmax · Dropout