# Accelerated Neural Networks on OpenCL Devices Using SYCL-DNN

**Authors:** Rod Burns, John Lawson, Duncan McBain, Daniel Soutar

arXiv: 1904.04174 · 2019-04-09

## TL;DR

This paper introduces SYCL-DNN, an open-source, hardware-agnostic library built on SYCL for accelerating neural network routines across diverse hardware platforms, demonstrating high performance on various accelerators.

## Contribution

The paper presents SYCL-DNN, a new C++ library that provides hardware-agnostic acceleration for neural network operations using SYCL, enabling broad hardware compatibility and high performance.

## Key findings

- High performance achieved on AMD, Intel, and ARM OpenCL devices.
- Library demonstrates hardware and vendor agnostic neural network acceleration.
- Performance figures show competitive results across diverse hardware.

## Abstract

Over the past few years machine learning has seen a renewed explosion of interest, following a number of studies showing the effectiveness of neural networks in a range of tasks which had previously been considered incredibly hard. Neural networks' effectiveness in the fields of image recognition and natural language processing stems primarily from the vast amounts of data available to companies and researchers, coupled with the huge amounts of compute power available in modern accelerators such as GPUs, FPGAs and ASICs. There are a number of approaches available to developers for utilizing GPGPU technologies such as SYCL, OpenCL and CUDA, however many applications require the same low level mathematical routines. Libraries dedicated to accelerating these common routines allow developers to easily make full use of the available hardware without requiring low level knowledge of the hardware themselves, however such libraries are often provided by hardware manufacturers for specific hardware such as cuDNN for Nvidia hardware or MIOpen for AMD hardware.   SYCL-DNN is a new open-source library dedicated to providing accelerated routines for neural network operations which are hardware and vendor agnostic. Built on top of the SYCL open standard and written entirely in standard C++, SYCL-DNN allows a user to easily accelerate neural network code for a wide range of hardware using a modern C++ interface. The library is tested on AMD's OpenCL for GPU, Intel's OpenCL for CPU and GPU, ARM's OpenCL for Mali GPUs as well as ComputeAorta's OpenCL for R-Car CV engine and host CPU. In this talk we will present performance figures for SYCL-DNN on this range of hardware, and discuss how high performance was achieved on such a varied set of accelerators with such different hardware features.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.04174/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/1904.04174/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/1904.04174/full.md

---
Source: https://tomesphere.com/paper/1904.04174