FLASH: Fast Neural Architecture Search with Hardware Optimization
Guihong Li, Sumit K. Mandal, Umit Y. Ogras, Radu Marculescu

TL;DR
FLASH introduces a rapid neural architecture search method that co-optimizes accuracy and hardware performance, utilizing an analytical metric and hierarchical optimization to achieve results in seconds on various hardware platforms.
Contribution
The paper presents a novel, training-free NAS approach with an analytical metric and hierarchical optimization, significantly reducing search time and enabling hardware-aware DNN design.
Findings
Achieves over 10,000x speedup compared to existing NAS methods.
Can perform NAS on a Raspberry Pi-3B in less than 3 seconds.
Accurately predicts latency, area, and energy consumption of DNNs on target hardware.
Abstract
Neural architecture search (NAS) is a promising technique to design efficient and high-performance deep neural networks (DNNs). As the performance requirements of ML applications grow continuously, the hardware accelerators start playing a central role in DNN design. This trend makes NAS even more complicated and time-consuming for most real applications. This paper proposes FLASH, a very fast NAS methodology that co-optimizes the DNN accuracy and performance on a real hardware platform. As the main theoretical contribution, we first propose the NN-Degree, an analytical metric to quantify the topological characteristics of DNNs with skip connections (e.g., DenseNets, ResNets, Wide-ResNets, and MobileNets). The newly proposed NN-Degree allows us to do training-free NAS within one second and build an accuracy predictor by training as few as 25 samples out of a vast search space with more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
