Slimmable Neural Networks
Jiahui Yu, Linjie Yang, Ning Xu, Jianchao Yang, Thomas Huang

TL;DR
This paper introduces slimmable neural networks that can dynamically adjust their width at runtime, enabling flexible accuracy-efficiency trade-offs without multiple models.
Contribution
It proposes a simple method to train a single neural network with switchable batch normalization for adaptive width adjustment during inference.
Findings
Achieves comparable or better accuracy than individually trained models.
Performs well across various tasks like detection and segmentation.
Enables instant adaptation to resource constraints at runtime.
Abstract
We present a simple and general method to train a single neural network executable at different widths (number of channels in a layer), permitting instant and adaptive accuracy-efficiency trade-offs at runtime. Instead of training individual networks with different width configurations, we train a shared network with switchable batch normalization. At runtime, the network can adjust its width on the fly according to on-device benchmarks and resource constraints, rather than downloading and offloading different models. Our trained networks, named slimmable neural networks, achieve similar (and in many cases better) ImageNet classification accuracy than individually trained models of MobileNet v1, MobileNet v2, ShuffleNet and ResNet-50 at different widths respectively. We also demonstrate better performance of slimmable models compared with individual ones across a wide range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
MethodsInverted Residual Block · Tether Customer Service Number +1-833-534-1729 · Depthwise Separable Convolution · MobileNetV1 · Bottleneck Residual Block · Residual Block · Kaiming Initialization · Bitcoin Customer Service Number +1-833-534-1729 · 1x1 Convolution · Grouped Convolution
