Convolutional Neural Networks In Convolution
Xiaobo Huang

TL;DR
This paper introduces CNN In Convolution (CNNIC), a novel wider CNN architecture that uses small CNNs as convolutional kernels to improve accuracy without data transmutation, demonstrated on MNIST.
Contribution
The paper proposes a new CNN architecture, CNNIC, that employs small CNNs as kernels, enhancing accuracy and training stability over traditional models.
Findings
Achieved high classification accuracy on MNIST.
Utilized dropout and orthonormal initialization for better training.
Demonstrated the effectiveness of CNNIC architecture.
Abstract
Currently, increasingly deeper neural networks have been applied to improve their accuracy. In contrast, We propose a novel wider Convolutional Neural Networks (CNN) architecture, motivated by the Multi-column Deep Neural Networks and the Network In Network(NIN), aiming for higher accuracy without input data transmutation. In our architecture, namely "CNN In Convolution"(CNNIC), a small CNN, instead of the original generalized liner model(GLM) based filters, is convoluted as kernel on the original image, serving as feature extracting layer of this networks. And further classifications are then carried out by a global average pooling layer and a softmax layer. Dropout and orthonormal initialization are applied to overcome training difficulties including slow convergence and over-fitting. Persuasive classification performance is demonstrated on MNIST.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
MethodsSoftmax · Dropout
