ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, Jian Sun

TL;DR
ShuffleNet is a highly efficient CNN architecture optimized for mobile devices, employing novel operations to reduce computation while maintaining high accuracy, and demonstrating significant speedups over traditional models.
Contribution
Introduces ShuffleNet, a new CNN architecture with pointwise group convolution and channel shuffle, achieving high efficiency and accuracy for mobile applications.
Findings
Outperforms MobileNet with 7.8% lower top-1 error on ImageNet at 40 MFLOPs
Achieves ~13x speedup over AlexNet on ARM devices
Maintains comparable accuracy with significantly reduced computation
Abstract
We introduce an extremely computation-efficient CNN architecture named ShuffleNet, which is designed specially for mobile devices with very limited computing power (e.g., 10-150 MFLOPs). The new architecture utilizes two new operations, pointwise group convolution and channel shuffle, to greatly reduce computation cost while maintaining accuracy. Experiments on ImageNet classification and MS COCO object detection demonstrate the superior performance of ShuffleNet over other structures, e.g. lower top-1 error (absolute 7.8%) than recent MobileNet on ImageNet classification task, under the computation budget of 40 MFLOPs. On an ARM-based mobile device, ShuffleNet achieves ~13x actual speedup over AlexNet while maintaining comparable accuracy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗onnxmodelzoo/shufflenet-3model
- 🤗onnxmodelzoo/shufflenet-6model
- 🤗onnxmodelzoo/shufflenet-7model
- 🤗onnxmodelzoo/shufflenet-8model
- 🤗onnxmodelzoo/shufflenet-9model
- 🤗onnxmodelzoo/shufflenet-v2-10model
- 🤗onnxmodelzoo/shufflenet-v2-12-int8model
- 🤗onnxmodelzoo/shufflenet-v2-12-qdqmodel
- 🤗onnxmodelzoo/shufflenet-v2-12model
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
Methods1x1 Convolution · Residual Connection · Region Proposal Network · Average Pooling · Local Response Normalization · Global Average Pooling · Grouped Convolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout
