CompactNet: Platform-Aware Automatic Optimization for Convolutional   Neural Networks

Weicheng Li; Rui Wang; Zhongzhi Luan; Di Huang; Zidong Du; Yunji Chen; and Depei Qian

arXiv:1905.11669·cs.LG·May 29, 2019·1 cites

CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks

Weicheng Li, Rui Wang, Zhongzhi Luan, Di Huang, Zidong Du, Yunji Chen, and Depei Qian

PDF

Open Access 1 Repo

TL;DR

CompactNet is an automated platform-aware optimization tool that trims pre-trained CNNs to meet specific speed and resource constraints while preserving accuracy, demonstrated on mobile and NPU platforms.

Contribution

It introduces a novel method for automatic, platform-specific CNN model optimization guided by a simulator, enabling efficient deployment on resource-limited devices.

Findings

01

Achieves up to 1.8x speedup on MobileNetV2

02

Maintains or improves accuracy after optimization

03

Effective on ARM CPU and NPU platforms

Abstract

Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications. Meanwhile, due to the complex model structures against strict latency and memory restriction, the implementation of CNN models on the resource-limited platforms is becoming more challenging. This work proposes a solution, called CompactNet\footnote{Project URL: \url{https://github.com/CompactNet/CompactNet}}, which automatically optimizes a pre-trained CNN model on a specific resource-limited platform given a specific target of inference speedup. Guided by a simulator of the target platform, CompactNet progressively trims a pre-trained network by removing certain redundant filters until the target speedup is reached and generates an optimal platform-specific model while maintaining the accuracy. We evaluate our work on two platforms of a mobile ARM CPU and a machine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CompactNet/CompactNet
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · 1x1 Convolution · Batch Normalization · Inverted Residual Block · Convolution · Average Pooling · Tether Customer Service Number +1-833-534-1729