CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks
Weicheng Li, Rui Wang, Zhongzhi Luan, Di Huang, Zidong Du, Yunji Chen, and Depei Qian

TL;DR
CompactNet is an automated platform-aware optimization tool that trims pre-trained CNNs to meet specific speed and resource constraints while preserving accuracy, demonstrated on mobile and NPU platforms.
Contribution
It introduces a novel method for automatic, platform-specific CNN model optimization guided by a simulator, enabling efficient deployment on resource-limited devices.
Findings
Achieves up to 1.8x speedup on MobileNetV2
Maintains or improves accuracy after optimization
Effective on ARM CPU and NPU platforms
Abstract
Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications. Meanwhile, due to the complex model structures against strict latency and memory restriction, the implementation of CNN models on the resource-limited platforms is becoming more challenging. This work proposes a solution, called CompactNet\footnote{Project URL: \url{https://github.com/CompactNet/CompactNet}}, which automatically optimizes a pre-trained CNN model on a specific resource-limited platform given a specific target of inference speedup. Guided by a simulator of the target platform, CompactNet progressively trims a pre-trained network by removing certain redundant filters until the target speedup is reached and generates an optimal platform-specific model while maintaining the accuracy. We evaluate our work on two platforms of a mobile ARM CPU and a machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · 1x1 Convolution · Batch Normalization · Inverted Residual Block · Convolution · Average Pooling · Tether Customer Service Number +1-833-534-1729
