Optimal DNN Primitive Selection with Partitioned Boolean Quadratic   Programming

Andrew Anderson; David Gregg

arXiv:1710.01079·cs.PF·November 5, 2018

Optimal DNN Primitive Selection with Partitioned Boolean Quadratic Programming

Andrew Anderson, David Gregg

PDF

Open Access

TL;DR

This paper formulates the problem of selecting optimal DNN computational primitives considering data format transformations as a PBQP, providing an analytic solution that improves performance over existing libraries.

Contribution

It models the primitive selection problem as a PBQP, proves its NP-hardness, and offers an analytic solution evaluated on real platforms showing significant performance gains.

Findings

01

Significant performance improvements over vendor libraries.

02

The primitive selection problem is NP-hard and can be modeled as PBQP.

03

The proposed solution is effective on embedded and general-purpose platforms.

Abstract

Deep Neural Networks (DNNs) require very large amounts of computation both for training and for inference when deployed in the field. Many different algorithms have been proposed to implement the most computationally expensive layers of DNNs. Further, each of these algorithms has a large number of variants, which offer different trade-offs of parallelism, data locality, memory footprint, and execution time. In addition, specific algorithms operate much more efficiently on specialized data layouts and formats. We state the problem of optimal primitive selection in the presence of data format transformations, and show that it is NP-hard by demonstrating an embedding in the Partitioned Boolean Quadratic Assignment problem (PBQP). We propose an analytic solution via a PBQP solver, and evaluate our approach experimentally by optimizing several popular DNNs using a library of more than 70…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Memory and Neural Computing · Adversarial Robustness in Machine Learning