TL;DR
MnasNet introduces a platform-aware neural architecture search method that directly optimizes for accuracy and real-world latency on mobile devices, resulting in highly efficient CNN models.
Contribution
The paper presents a novel NAS approach that incorporates actual latency measurements and a hierarchical search space for better mobile CNN design.
Findings
Outperforms state-of-the-art mobile CNNs in accuracy and speed
Achieves 75.2% top-1 accuracy with 78ms latency on ImageNet
Provides improved object detection performance over MobileNets
Abstract
Designing convolutional neural networks (CNN) for mobile devices is challenging because mobile models need to be small and fast, yet still accurate. Although significant efforts have been dedicated to design and improve mobile CNNs on all dimensions, it is very difficult to manually balance these trade-offs when there are so many architectural possibilities to consider. In this paper, we propose an automated mobile neural architecture search (MNAS) approach, which explicitly incorporate model latency into the main objective so that the search can identify a model that achieves a good trade-off between accuracy and latency. Unlike previous work, where latency is considered via another, often inaccurate proxy (e.g., FLOPS), our approach directly measures real-world inference latency by executing the model on mobile phones. To further strike the right balance between flexibility and search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗timm/mnasnet_100.rmsp_in1kmodel· 7.5k dl7.5k dl
- 🤗timm/mnasnet_small.lamb_in1kmodel· 775 dl775 dl
- 🤗timm/semnasnet_075.rmsp_in1kmodel· 236 dl236 dl
- 🤗timm/semnasnet_100.rmsp_in1kmodel· 82 dl82 dl
- 🤗kadirnar/timm_model_listmodel· ♡ 1♡ 1
- 🤗amd/mnasnet_b1model· ♡ 1♡ 1
- 🤗qualcomm/MNASNet05model· 156 dl156 dl
- 🤗Kalray/nasnetmodel
- 🤗STMicroelectronics/mnasnet_ptmodel
- 🤗STMicroelectronics/semnasnet_ptmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Depthwise Convolution · Pointwise Convolution · Random Resized Crop · Random Horizontal Flip · Weight Decay · Linear Warmup With Linear Decay
