Modeling Data Reuse in Deep Neural Networks by Taking Data-Types into   Cognizance

Nandan Kumar Jha; Sparsh Mittal

arXiv:2008.02565·cs.CV·August 7, 2020

Modeling Data Reuse in Deep Neural Networks by Taking Data-Types into Cognizance

Nandan Kumar Jha, Sparsh Mittal

PDF

TL;DR

This paper introduces a data type aware model to better estimate data reuse and energy efficiency in deep neural networks, addressing limitations of traditional arithmetic intensity metrics.

Contribution

It proposes a novel data type aware weighted arithmetic intensity model that improves data reuse estimation in DNNs compared to conventional methods.

Findings

01

Accurately models data reuse for various DNN architectures.

02

Better predicts energy efficiency of DNNs on GPU hardware.

03

Demonstrates generality using the central limit theorem.

Abstract

In recent years, researchers have focused on reducing the model size and number of computations (measured as "multiply-accumulate" or MAC operations) of DNNs. The energy consumption of a DNN depends on both the number of MAC operations and the energy efficiency of each MAC operation. The former can be estimated at design time; however, the latter depends on the intricate data reuse patterns and underlying hardware architecture. Hence, estimating it at design time is challenging. This work shows that the conventional approach to estimate the data reuse, viz. arithmetic intensity, does not always correctly estimate the degree of data reuse in DNNs since it gives equal importance to all the data types. We propose a novel model, termed "data type aware weighted arithmetic intensity" ( $D I$ ), which accounts for the unequal importance of different data types in DNNs. We evaluate our model on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution