HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time   and Robustness

Dennis Rieber; Moritz Reiber; Oliver Bringmann; Holger; Fr\"oning

arXiv:2205.15568·cs.LG·June 1, 2022·1 cites

HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time and Robustness

Dennis Rieber, Moritz Reiber, Oliver Bringmann, Holger, Fr\"oning

PDF

Open Access

TL;DR

This paper introduces a hardware-aware initialization method for DNN auto-tuning that reduces measurement costs and enhances robustness by considering configuration validity, specifically applied to VTA hardware.

Contribution

It proposes a validity-driven initialization approach for AutoTVM that decreases hardware measurements and improves search robustness in DNN auto-tuning.

Findings

01

Reduces hardware measurements to 41.6% of original

02

Improves robustness of auto-tuning process

03

Effectively handles invalid configurations on hardware accelerators

Abstract

The process of optimizing the latency of DNN operators with ML models and hardware-in-the-loop, called auto-tuning, has established itself as a pervasive method for the deployment of neural networks. From a search space of loop-optimizations, the candidate providing the best performance has to be selected. Performance of individual configurations is evaluated through hardware measurements. The combinatorial explosion of possible configurations, together with the cost of hardware evaluation makes exhaustive explorations of the search space infeasible in practice. Machine Learning methods, like random forests or reinforcement learning are used to aid in the selection of candidates for hardware evaluation. For general purpose hardware like x86 and GPGPU architectures impressive performance gains can be achieved, compared to hand-optimized libraries like cuDNN. The method is also useful in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Advanced Neural Network Applications