On filter design in deep convolutional neural network
Gaurav Hirani, Waleed Abdulla

TL;DR
This paper investigates the impact of filter design choices, including initialization and size, on the learning and optimization of deep convolutional neural networks, highlighting the need for mathematical understanding of these hyper-parameters.
Contribution
It provides a comprehensive analysis of filter parameters in DCNNs, which are often treated as hyper-parameters, and discusses their effects on learning and optimization.
Findings
Analyzes the effects of filter size and initialization on learning.
Evaluates unsupervised approaches for filter design.
Discusses limitations and future challenges in filter optimization.
Abstract
The deep convolutional neural network (DCNN) in computer vision has given promising results. It is widely applied in many areas, from medicine, agriculture, self-driving car, biometric system, and almost all computer vision-based applications. Filters or weights are the critical elements responsible for learning in DCNN. Backpropagation has been the primary learning algorithm for DCNN and provides promising results, but the size and numbers of the filters remain hyper-parameters. Various studies have been done in the last decade on semi-supervised, self-supervised, and unsupervised methods and their properties. The effects of filter initialization, size-shape selection, and the number of filters on learning and optimization have not been investigated in a separate publication to collate all the options. Such attributes are often treated as hyper-parameters and lack mathematical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Speech and Audio Processing
MethodsDiffusion-Convolutional Neural Networks
