HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time and Robustness
Dennis Rieber, Moritz Reiber, Oliver Bringmann, Holger, Fr\"oning

TL;DR
This paper introduces a hardware-aware initialization method for DNN auto-tuning that reduces measurement costs and enhances robustness by considering configuration validity, specifically applied to VTA hardware.
Contribution
It proposes a validity-driven initialization approach for AutoTVM that decreases hardware measurements and improves search robustness in DNN auto-tuning.
Findings
Reduces hardware measurements to 41.6% of original
Improves robustness of auto-tuning process
Effectively handles invalid configurations on hardware accelerators
Abstract
The process of optimizing the latency of DNN operators with ML models and hardware-in-the-loop, called auto-tuning, has established itself as a pervasive method for the deployment of neural networks. From a search space of loop-optimizations, the candidate providing the best performance has to be selected. Performance of individual configurations is evaluated through hardware measurements. The combinatorial explosion of possible configurations, together with the cost of hardware evaluation makes exhaustive explorations of the search space infeasible in practice. Machine Learning methods, like random forests or reinforcement learning are used to aid in the selection of candidates for hardware evaluation. For general purpose hardware like x86 and GPGPU architectures impressive performance gains can be achieved, compared to hand-optimized libraries like cuDNN. The method is also useful in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Advanced Neural Network Applications
