Design of Kernels in Convolutional Neural Networks for Image Classification
Zhun Sun, Mete Ozay, Takayuki Okatani

TL;DR
This paper investigates how the shape of convolution kernels in CNNs influences learned features and classification performance, proposing a novel kernel design that improves accuracy and robustness while reducing computational costs.
Contribution
It introduces a new kernel shape design for CNNs based on receptive field analysis, achieving state-of-the-art results and enhanced robustness in image classification tasks.
Findings
Achieved state-of-the-art accuracy on ILSVRC-2012 and CIFAR datasets.
Reduced model parameters and computational time.
Improved robustness to occlusion in image classification.
Abstract
Despite the effectiveness of Convolutional Neural Networks (CNNs) for image classification, our understanding of the relationship between shape of convolution kernels and learned representations is limited. In this work, we explore and employ the relationship between shape of kernels which define Receptive Fields (RFs) in CNNs for learning of feature representations and image classification. For this purpose, we first propose a feature visualization method for visualization of pixel-wise classification score maps of learned features. Motivated by our experimental results, and observations reported in the literature for modeling of visual systems, we propose a novel design of shape of kernels for learning of representations in CNNs. In the experimental results, we achieved a state-of-the-art classification performance compared to a base CNN model [28] by reducing the number of parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Cell Image Analysis Techniques · Generative Adversarial Networks and Image Synthesis
