Does Form Follow Function? An Empirical Exploration of the Impact of   Deep Neural Network Architecture Design on Hardware-Specific Acceleration

Saad Abbasi; Mohammad Javad Shafiee; Ellick Chan; and Alexander Wong

arXiv:2107.04144·cs.LG·July 12, 2021

Does Form Follow Function? An Empirical Exploration of the Impact of Deep Neural Network Architecture Design on Hardware-Specific Acceleration

Saad Abbasi, Mohammad Javad Shafiee, Ellick Chan, and Alexander Wong

PDF

Open Access

TL;DR

This study empirically investigates how different deep neural network architecture designs influence hardware-specific acceleration, revealing significant variability in speedup gains across patterns and architectures, with some designs achieving over 1200% improvement.

Contribution

It provides a comprehensive empirical analysis of the relationship between neural network architecture design and hardware acceleration performance, highlighting design patterns that maximize speedup.

Findings

01

Hardware-specific acceleration achieved an average 380% speedup.

02

Depthwise bottleneck convolution pattern achieved 550% speedup.

03

DARTS-derived architecture benefited from 1200% acceleration.

Abstract

The fine-grained relationship between form and function with respect to deep neural network architecture design and hardware-specific acceleration is one area that is not well studied in the research literature, with form often dictated by accuracy as opposed to hardware function. In this study, a comprehensive empirical exploration is conducted to investigate the impact of deep neural network architecture design on the degree of inference speedup that can be achieved via hardware-specific acceleration. More specifically, we empirically study the impact of a variety of commonly used macro-architecture design patterns across different architectural depths through the lens of OpenVINO microprocessor-specific and GPU-specific acceleration. Experimental results showed that while leveraging hardware-specific acceleration achieved an average inference speed-up of 380%, the degree of inference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Advanced Neural Network Applications

MethodsConvolution