On filter design in deep convolutional neural network

Gaurav Hirani; Waleed Abdulla

arXiv:2410.21644·cs.CV·November 1, 2024

On filter design in deep convolutional neural network

Gaurav Hirani, Waleed Abdulla

PDF

Open Access

TL;DR

This paper investigates the impact of filter design choices, including initialization and size, on the learning and optimization of deep convolutional neural networks, highlighting the need for mathematical understanding of these hyper-parameters.

Contribution

It provides a comprehensive analysis of filter parameters in DCNNs, which are often treated as hyper-parameters, and discusses their effects on learning and optimization.

Findings

01

Analyzes the effects of filter size and initialization on learning.

02

Evaluates unsupervised approaches for filter design.

03

Discusses limitations and future challenges in filter optimization.

Abstract

The deep convolutional neural network (DCNN) in computer vision has given promising results. It is widely applied in many areas, from medicine, agriculture, self-driving car, biometric system, and almost all computer vision-based applications. Filters or weights are the critical elements responsible for learning in DCNN. Backpropagation has been the primary learning algorithm for DCNN and provides promising results, but the size and numbers of the filters remain hyper-parameters. Various studies have been done in the last decade on semi-supervised, self-supervised, and unsupervised methods and their properties. The effects of filter initialization, size-shape selection, and the number of filters on learning and optimization have not been investigated in a separate publication to collate all the options. Such attributes are often treated as hyper-parameters and lack mathematical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Speech and Audio Processing

MethodsDiffusion-Convolutional Neural Networks