Accelerator-aware Neural Network Design using AutoML
Suyog Gupta, Berkin Akin

TL;DR
This paper introduces hardware-aware neural architecture search tailored for the Edge TPU, resulting in models that deliver real-time image classification with high accuracy on low-power edge devices.
Contribution
It presents a novel AutoML-based approach for designing neural networks optimized specifically for Edge TPU hardware, enhancing performance and accuracy.
Findings
Models achieve real-time classification on Edge TPU.
Improved accuracy-latency tradeoff over existing mobile models.
Enhanced performance on Pixel 4's Edge TPU.
Abstract
While neural network hardware accelerators provide a substantial amount of raw compute throughput, the models deployed on them must be co-designed for the underlying hardware architecture to obtain the optimal system performance. We present a class of computer vision models designed using hardware-aware neural architecture search and customized to run on the Edge TPU, Google's neural network hardware accelerator for low-power, edge devices. For the Edge TPU in Coral devices, these models enable real-time image classification performance while achieving accuracy typically seen only with larger, compute-heavy models running in data centers. On Pixel 4's Edge TPU, these models improve the accuracy-latency tradeoff over existing SoTA mobile models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗timm/efficientnet_el.ra_in1kmodel· 252 dl252 dl
- 🤗timm/efficientnet_el_pruned.in1kmodel· 139 dl139 dl
- 🤗timm/efficientnet_em.ra2_in1kmodel· 825 dl825 dl
- 🤗timm/efficientnet_es.ra_in1kmodel· 185 dl185 dl
- 🤗timm/efficientnet_es_pruned.in1kmodel· 155 dl155 dl
- 🤗timm/tf_efficientnet_el.in1kmodel· 71 dl71 dl
- 🤗timm/tf_efficientnet_em.in1kmodel· 72 dl72 dl
- 🤗timm/tf_efficientnet_es.in1kmodel· 479 dl479 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Neural Network Applications · Advanced Memory and Neural Computing
MethodsCorrelation Alignment for Deep Domain Adaptation · Sigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
