Does Form Follow Function? An Empirical Exploration of the Impact of Deep Neural Network Architecture Design on Hardware-Specific Acceleration
Saad Abbasi, Mohammad Javad Shafiee, Ellick Chan, and Alexander Wong

TL;DR
This study empirically investigates how different deep neural network architecture designs influence hardware-specific acceleration, revealing significant variability in speedup gains across patterns and architectures, with some designs achieving over 1200% improvement.
Contribution
It provides a comprehensive empirical analysis of the relationship between neural network architecture design and hardware acceleration performance, highlighting design patterns that maximize speedup.
Findings
Hardware-specific acceleration achieved an average 380% speedup.
Depthwise bottleneck convolution pattern achieved 550% speedup.
DARTS-derived architecture benefited from 1200% acceleration.
Abstract
The fine-grained relationship between form and function with respect to deep neural network architecture design and hardware-specific acceleration is one area that is not well studied in the research literature, with form often dictated by accuracy as opposed to hardware function. In this study, a comprehensive empirical exploration is conducted to investigate the impact of deep neural network architecture design on the degree of inference speedup that can be achieved via hardware-specific acceleration. More specifically, we empirically study the impact of a variety of commonly used macro-architecture design patterns across different architectural depths through the lens of OpenVINO microprocessor-specific and GPU-specific acceleration. Experimental results showed that while leveraging hardware-specific acceleration achieved an average inference speed-up of 380%, the degree of inference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Advanced Neural Network Applications
MethodsConvolution
