On Latency Predictors for Neural Architecture Search
Yash Akhauri, Mohamed S. Abdelfattah

TL;DR
This paper develops a comprehensive and robust latency prediction framework for neural architecture search, significantly improving prediction accuracy and enabling faster hardware-aware NAS by studying predictor design choices and transfer learning strategies.
Contribution
It introduces a systematic suite of latency prediction tasks and a general predictor architecture, leading to an end-to-end training strategy that outperforms existing methods on multiple benchmarks.
Findings
Improves latency prediction accuracy by 22.5% on average.
Achieves up to 87.6% improvement on hardest tasks.
Enables 5.8x faster hardware-aware NAS.
Abstract
Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN architectures that satisfy a latency constraint on a specific hardware device. Central to these search algorithms is a prediction model that is designed to provide a hardware latency estimate for a candidate NN architecture. Recent research has shown that the sample efficiency of these predictive models can be greatly improved through pre-training on some \textit{training} devices with many samples, and then transferring the predictor on the \textit{test} (target) device. Transfer learning and meta-learning methods have been used for this, but often exhibit significant performance variability. Additionally, the evaluation of existing latency predictors has been largely done on hand-crafted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Image and Video Retrieval Techniques
