TL;DR
NASA introduces an automated framework combining neural architecture search and hardware acceleration to develop efficient, multiplication-reduced deep neural networks suitable for resource-constrained environments.
Contribution
The paper presents NASA, a novel NAS framework with hardware-inspired operators and a dedicated accelerator, enabling automated design of multiplication-free DNNs with improved efficiency.
Findings
NASA achieves competitive accuracy with multiplication-reduced DNNs.
The co-designed hardware accelerates the execution of NASA-designed networks.
Experimental results validate the efficiency and effectiveness of NASA's approach.
Abstract
Multiplication is arguably the most cost-dominant operation in modern deep neural networks (DNNs), limiting their achievable efficiency and thus more extensive deployment in resource-constrained applications. To tackle this limitation, pioneering works have developed handcrafted multiplication-free DNNs, which require expert knowledge and time-consuming manual iteration, calling for fast development tools. To this end, we propose a Neural Architecture Search and Acceleration framework dubbed NASA, which enables automated multiplication-reduced DNN development and integrates a dedicated multiplication-reduced accelerator for boosting DNNs' achievable efficiency. Specifically, NASA adopts neural architecture search (NAS) spaces that augment the state-of-the-art one with hardware-inspired multiplication-free operators, such as shift and adder, armed with a novel progressive pretrain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
