A Formalism of DNN Accelerator Flexibility
Sheng-Chun Kao, Hyoukjun Kwon, Michael Pellauer, Angshuman Parashar,, Tushar Krishna

TL;DR
This paper introduces a formal framework to quantify and analyze the flexibility of DNN accelerators across multiple design axes, enabling systematic exploration and optimization for evolving neural network workloads.
Contribution
It formally defines accelerator flexibility, categorizes existing designs, and develops a flexibility-aware design-space exploration framework for systematic evaluation.
Findings
Adding flexibility features improves runtime by 11.8x on modern DNNs.
Categorization of accelerators into 16 classes based on flexibility axes.
First formal approach to quantify and analyze DNN accelerator flexibility.
Abstract
The high efficiency of domain-specific hardware accelerators for machine learning (ML) has come from specialization, with the trade-off of less configurability/ flexibility. There is growing interest in developing flexible ML accelerators to make them future-proof to the rapid evolution of Deep Neural Networks (DNNs). However, the notion of accelerator flexibility has always been used in an informal manner, restricting computer architects from conducting systematic apples-to-apples design-space exploration (DSE) across trillions of choices. In this work, we formally define accelerator flexibility and show how it can be integrated for DSE. Specifically, we capture DNN accelerator flexibility across four axes: tiling, ordering, parallelization, and array shape. We categorize existing accelerators into 16 classes based on their axes of flexibility support, and define a precise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Ferroelectric and Negative Capacitance Devices · Advanced Neural Network Applications
