FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
Bichen Wu, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming, Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, Kurt Keutzer

TL;DR
This paper introduces a differentiable neural architecture search framework for designing efficient ConvNets tailored for mobile devices, achieving state-of-the-art accuracy and latency with significantly reduced search costs.
Contribution
The authors propose a gradient-based NAS method that efficiently discovers hardware-aware ConvNet architectures, outperforming prior methods in accuracy, speed, and search efficiency.
Findings
FBNet-B achieves 74.1% top-1 accuracy on ImageNet.
FBNet-B is 2.4x smaller and 1.5x faster than MobileNetV2-1.3.
Search cost is 420x smaller than MnasNet, at 216 GPU-hours.
Abstract
Designing accurate and efficient ConvNets for mobile devices is challenging because the design space is combinatorially large. Due to this, previous neural architecture search (NAS) methods are computationally expensive. ConvNet architecture optimality depends on factors such as input resolution and target devices. However, existing approaches are too expensive for case-by-case redesigns. Also, previous work focuses primarily on reducing FLOPs, but FLOP count does not always reflect actual latency. To address these, we propose a differentiable neural architecture search (DNAS) framework that uses gradient-based methods to optimize ConvNet architectures, avoiding enumerating and training individual architectures separately as in previous methods. FBNets, a family of models discovered by DNAS surpass state-of-the-art models both designed manually and generated automatically. FBNet-B…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsDepthwise Separable Convolution · Batch Normalization · Inverted Residual Block · Residual Connection · Convolution · Residual Block · Average Pooling · Gumbel Softmax · Global Average Pooling · Grouped Convolution
