Optimising the Performance of Convolutional Neural Networks across   Computing Systems using Transfer Learning

Rik Mulder; Valentin Radu; Christophe Dubach

arXiv:2010.10621·cs.LG·October 22, 2020

Optimising the Performance of Convolutional Neural Networks across Computing Systems using Transfer Learning

Rik Mulder, Valentin Radu, Christophe Dubach

PDF

Open Access

TL;DR

This paper introduces a transfer learning-based performance modeling approach to optimize convolutional neural network inference across different hardware platforms, significantly reducing profiling time from hours to seconds.

Contribution

It presents a novel machine learning method that replaces extensive profiling with transfer learning, enabling rapid and transferable performance estimation for neural network primitives.

Findings

01

Performance modeling reduces optimization time from hours to seconds.

02

Transfer learning enables model adaptation to different hardware with minimal data.

03

The approach maintains high accuracy across diverse computing platforms.

Abstract

The choice of convolutional routines (primitives) to implement neural networks has a tremendous impact on their inference performance (execution speed) on a given hardware platform. To optimise a neural network by primitive selection, the optimal primitive is identified for each layer of the network. This process requires a lengthy profiling stage, iterating over all the available primitives for each layer configuration, to measure their execution time on the target platform. Because each primitive exploits the hardware in different ways, new profiling is needed to obtain the best performance when moving to another platform. In this work, we propose to replace this prohibitively expensive profiling stage with a machine learning based approach of performance modeling. Our approach speeds up the optimisation time drastically. After training, our performance model can estimate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification