Kernel function impact on convolutional neural networks
M.Amine Mahmoudi, Aladine Chetouani, Fatma Boufera, Hedi Tabia

TL;DR
This paper explores the effects of different kernel functions in CNNs, proposing new methods like distortion-aware pooling and Kernelized Dense Layers to enhance performance and discriminative power, especially in fine-grained classification tasks.
Contribution
It introduces novel kernel-based layers and pooling techniques that improve CNN accuracy and detail sensitivity, outperforming classical methods on multiple datasets.
Findings
Kernel functions impact CNN performance significantly.
Distortion-aware pooling reduces overfitting effectively.
Kernelized Dense Layers enhance discriminative power.
Abstract
This paper investigates the usage of kernel functions at the different layers in a convolutional neural network. We carry out extensive studies of their impact on convolutional, pooling and fully-connected layers. We notice that the linear kernel may not be sufficiently effective to fit the input data distributions, whereas high order kernels prone to over-fitting. This leads to conclude that a trade-off between complexity and performance should be reached. We show how one can effectively leverage kernel functions, by introducing a more distortion aware pooling layers which reduces over-fitting while keeping track of the majority of the information fed into subsequent layers. We further propose Kernelized Dense Layers (KDL), which replace fully-connected layers, and capture higher order feature interactions. The experiments on conventional classification datasets i.e. MNIST,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Human Pose and Action Recognition · Advanced Image and Video Retrieval Techniques
MethodsAttentive Walk-Aggregating Graph Neural Network
