Adaptive Convolution Kernel for Artificial Neural Networks

F. Boray Tek; \.Ilker \c{C}am; Deniz Karl{\i}

arXiv:2009.06385·cs.CV·September 15, 2020

Adaptive Convolution Kernel for Artificial Neural Networks

F. Boray Tek, \.Ilker \c{C}am, Deniz Karl{\i}

PDF

1 Repo

TL;DR

This paper introduces a differentiable method for training convolutional kernel sizes within neural networks, enabling adaptive kernel growth or shrinkage to improve performance across various image tasks.

Contribution

It presents a novel Gaussian envelope-based approach for adaptive convolution kernels that can be trained via backpropagation, enhancing neural network flexibility and accuracy.

Findings

01

Adaptive kernels outperform fixed-size kernels in image classification tasks.

02

Replacing standard convolution with adaptive layers improves segmentation performance.

03

Statistically significant gains observed across multiple datasets.

Abstract

Many deep neural networks are built by using stacked convolutional layers of fixed and single size (often 3 $\times$ 3) kernels. This paper describes a method for training the size of convolutional kernels to provide varying size kernels in a single layer. The method utilizes a differentiable, and therefore backpropagation-trainable Gaussian envelope which can grow or shrink in a base grid. Our experiments compared the proposed adaptive layers to ordinary convolution layers in a simple two-layer network, a deeper residual network, and a U-Net architecture. The results in the popular image classification datasets such as MNIST, MNIST-CLUTTERED, CIFAR-10, Fashion, and ``Faces in the Wild'' showed that the adaptive kernels can provide statistically significant improvements on ordinary convolution kernels. A segmentation experiment in the Oxford-Pets dataset demonstrated that replacing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

btekgit/AdaptiveCNN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConcatenated Skip Connection · Max Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · U-Net