MAPLE-X: Latency Prediction with Explicit Microprocessor Prior Knowledge

Saad Abbasi; Alexander Wong; Mohammad Javad Shafiee

arXiv:2205.12660·cs.LG·May 26, 2022·1 cites

MAPLE-X: Latency Prediction with Explicit Microprocessor Prior Knowledge

Saad Abbasi, Alexander Wong, Mohammad Javad Shafiee

PDF

Open Access

TL;DR

MAPLE-X enhances neural network latency prediction by incorporating explicit hardware and architecture priors, leading to more accurate and robust estimations across diverse embedded devices, thus improving neural architecture search efficiency.

Contribution

It introduces MAPLE-X, which extends MAPLE by integrating explicit hardware and architecture priors, improving latency prediction accuracy and robustness in diverse hardware environments.

Findings

01

Achieves 5% better accuracy than MAPLE in latency prediction.

02

Demonstrates 9% improvement over HELP in experimental benchmarks.

03

Uses virtual examples and hardware similarity to enhance model robustness.

Abstract

Deep neural network (DNN) latency characterization is a time-consuming process and adds significant cost to Neural Architecture Search (NAS) processes when searching for efficient convolutional neural networks for embedded vision applications. DNN Latency is a hardware dependent metric and requires direct measurement or inference on target hardware. A recently introduced latency estimation technique known as MAPLE predicts DNN execution time on previously unseen hardware devices by using hardware performance counters. Leveraging these hardware counters in the form of an implicit prior, MAPLE achieves state-of-the-art performance in latency prediction. Here, we propose MAPLE-X which extends MAPLE by incorporating explicit prior knowledge of hardware devices and DNN architecture latency to better account for model stability and robustness. First, by identifying DNN architectures that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Memory and Neural Computing · CCD and CMOS Imaging Sensors

MethodsConvolution