TL;DR
This paper introduces Multiception, a novel depthwise multiscale convolution method that reduces network parameters by over 32% without sacrificing accuracy, outperforming existing approaches across multiple datasets.
Contribution
The paper proposes Multiception, a new convolution technique with multiscale kernels that maintains accuracy while significantly reducing parameters in CNNs.
Findings
Achieved over 32% parameter reduction on average.
Improved accuracy across all tested models and datasets.
Outperformed related methods in efficiency and performance.
Abstract
Deep convolutional neural networks have been proven successful in multiple benchmark challenges in recent years. However, the performance improvements are heavily reliant on increasingly complex network architecture and a high number of parameters, which require ever increasing amounts of storage and memory capacity. Depthwise separable convolution (DSConv) can effectively reduce the number of required parameters through decoupling standard convolution into spatial and cross-channel convolution steps. However, the method causes a degradation of accuracy. To address this problem, we present depthwise multiception convolution, termed Multiception, which introduces layer-wise multiscale kernels to learn multiscale representations of all individual input channels simultaneously. We have carried out the experiment on four benchmark datasets, i.e. Cifar-10, Cifar-100, STL-10 and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPointwise Convolution · Depthwise Convolution · Convolution · Depthwise Separable Convolution
