Multi-objective Evolutionary Approach for Efficient Kernel Size and Shape for CNN
Ziwei Wang, Martin A. Trefzer, Simon J. Bale, Andy M. Tyrrell

TL;DR
This paper presents a multi-objective evolutionary approach to optimize CNN kernel size and shape, significantly reducing computational costs while maintaining accuracy, especially for resource-constrained applications like IoT.
Contribution
It introduces a novel methodology using MOEAs to optimize kernel size and shape, including unconventional shapes, for efficient CNN architectures with minimal accuracy loss.
Findings
Up to 6X reduction in multiplications compared to benchmark CNNs
Unconventional kernel shapes outperform standard square kernels
Maintains similar classification accuracy on CIFAR-10
Abstract
While state-of-the-art development in CNN topology, such as VGGNet and ResNet, have become increasingly accurate, these networks are computationally expensive involving billions of arithmetic operations and parameters. To improve the classification accuracy, state-of-the-art CNNs usually involve large and complex convolutional layers. However, for certain applications, e.g. Internet of Things (IoT), where such CNNs are to be implemented on resource-constrained platforms, the CNN architectures have to be small and efficient. To deal with this problem, reducing the resource consumption in convolutional layers has become one of the most significant solutions. In this work, a multi-objective optimisation approach is proposed to trade-off between the amount of computation and network accuracy by using Multi-Objective Evolutionary Algorithms (MOEAs). The number of convolution kernels and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Video Surveillance and Tracking Methods
MethodsBatch Normalization · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Global Average Pooling · 1x1 Convolution · Bottleneck Residual Block · Residual Block · Kaiming Initialization · Max Pooling
