On Latency Predictors for Neural Architecture Search

Yash Akhauri; Mohamed S. Abdelfattah

arXiv:2403.02446·cs.LG·March 6, 2024·1 cites

On Latency Predictors for Neural Architecture Search

Yash Akhauri, Mohamed S. Abdelfattah

PDF

Open Access 1 Repo

TL;DR

This paper develops a comprehensive and robust latency prediction framework for neural architecture search, significantly improving prediction accuracy and enabling faster hardware-aware NAS by studying predictor design choices and transfer learning strategies.

Contribution

It introduces a systematic suite of latency prediction tasks and a general predictor architecture, leading to an end-to-end training strategy that outperforms existing methods on multiple benchmarks.

Findings

01

Improves latency prediction accuracy by 22.5% on average.

02

Achieves up to 87.6% improvement on hardest tasks.

03

Enables 5.8x faster hardware-aware NAS.

Abstract

Efficient deployment of neural networks (NN) requires the co-optimization of accuracy and latency. For example, hardware-aware neural architecture search has been used to automatically find NN architectures that satisfy a latency constraint on a specific hardware device. Central to these search algorithms is a prediction model that is designed to provide a hardware latency estimate for a candidate NN architecture. Recent research has shown that the sample efficiency of these predictive models can be greatly improved through pre-training on some \textit{training} devices with many samples, and then transferring the predictor on the \textit{test} (target) device. Transfer learning and meta-learning methods have been used for this, but often exhibit significant performance variability. Additionally, the evaluation of existing latency predictors has been largely done on hand-crafted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abdelfattah-lab/nasflat_latency
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Image and Video Retrieval Techniques