Generalizing the Convolution Operator in Convolutional Neural Networks
Kamaledin Ghiasi-Shirazi

TL;DR
This paper introduces two new classes of generalized convolution operators for CNNs using kernel functions and similarity measures, allowing more flexible feature extraction and demonstrating comparable performance to traditional CNNs on MNIST.
Contribution
It proposes a novel framework for generalizing convolution operators in CNNs through kernel and similarity-based functions, justified by historical and theoretical insights.
Findings
Generalized CNNs achieve performance comparable to standard CNNs on MNIST.
The framework justifies the use of sine activation functions in CNNs.
Distance-based generalized CNNs outperform traditional CNNs in certain configurations.
Abstract
Convolutional neural networks have become a main tool for solving many machine vision and machine learning problems. A major element of these networks is the convolution operator which essentially computes the inner product between a weight vector and the vectorized image patches extracted by sliding a window in the image planes of the previous layer. In this paper, we propose two classes of surrogate functions for the inner product operation inherent in the convolution operator and so attain two generalizations of the convolution operator. The first one is the class of positive definite kernel functions where their application is justified by the kernel trick. The second one is the class of similarity measures defined based on a distance function. We justify this by tracing back to the basic idea behind the neocognitron which is the ancestor of CNNs. Both methods are then further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
