Towards a learning-based performance modeling for accelerating Deep   Neural Networks

Damiano Perri; Paolo Sylos Labini; Osvaldo Gervasi; Sergio Tasso,; Flavio Vella

arXiv:2212.05031·cs.LG·December 12, 2022

Towards a learning-based performance modeling for accelerating Deep Neural Networks

Damiano Perri, Paolo Sylos Labini, Osvaldo Gervasi, Sergio Tasso,, Flavio Vella

PDF

TL;DR

This paper explores machine learning-based performance models to optimize CNN computations, demonstrating that predictive models can outperform manually optimized convolution operators on ARM Mali GPUs.

Contribution

It introduces a machine learning approach to performance modeling for CNNs, specifically applying decision trees and Bayesian classifiers to improve operator selection.

Findings

01

Predictive models outperform manual operator selection.

02

Models built using decision trees and Bayesian classifiers.

03

Validation on ARM Mali GPU shows improved performance.

Abstract

Emerging applications such as Deep Learning are often data-driven, thus traditional approaches based on auto-tuners are not performance effective across the wide range of inputs used in practice. In the present paper, we start an investigation of predictive models based on machine learning techniques in order to optimize Convolution Neural Networks (CNNs). As a use-case, we focus on the ARM Compute Library which provides three different implementations of the convolution operator at different numeric precision. Starting from a collation of benchmarks, we build and validate models learned by Decision Tree and naive Bayesian classifier. Preliminary experiments on Midgard-based ARM Mali GPU show that our predictive model outperforms all the convolution operators manually selected by the library.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLib · Convolution