Penetrating the Fog: the Path to Efficient CNN Models
Kun Wan, Boyuan Feng, Shu Yang, Yufei Ding

TL;DR
This paper introduces a novel approach to designing effective sparse kernels for CNNs, significantly reducing the design space and improving parameter efficiency without sacrificing accuracy, thus enabling more efficient models for mobile deployment.
Contribution
It presents a systematic scheme to craft sparse kernel designs by eliminating ineffective options based on composition, accuracy impact, and efficiency, which was not addressed in prior work.
Findings
The scheme identifies more parameter-efficient sparse kernel designs.
Experimental results show comparable or higher accuracy with fewer parameters.
The approach reduces the design space, facilitating efficient CNN model development.
Abstract
With the increasing demand to deploy convolutional neural networks (CNNs) on mobile platforms, the sparse kernel approach was proposed, which could save more parameters than the standard convolution while maintaining accuracy. However, despite the great potential, no prior research has pointed out how to craft an sparse kernel design with such potential (i.e., effective design), and all prior works just adopt simple combinations of existing sparse kernels such as group convolution. Meanwhile due to the large design space it is also impossible to try all combinations of existing sparse kernels. In this paper, we are the first in the field to consider how to craft an effective sparse kernel design by eliminating the large design space. Specifically, we present a sparse kernel scheme to illustrate how to reduce the space from three aspects. First, in terms of composition we remove designs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Video Surveillance and Tracking Methods
MethodsConvolution
