Towards High Performance Java-based Deep Learning Frameworks

Athanasios Stratikopoulos; Juan Fumero; Zoran Sevarac; Christos; Kotselidis

arXiv:2001.04206·cs.LG·January 14, 2020·1 cites

Towards High Performance Java-based Deep Learning Frameworks

Athanasios Stratikopoulos, Juan Fumero, Zoran Sevarac, Christos, Kotselidis

PDF

Open Access

TL;DR

This paper demonstrates how using TornadoVM to accelerate a Java-based deep learning framework can significantly improve training performance on GPUs, making deep learning more efficient and accessible.

Contribution

It introduces a method to transparently accelerate Java-based deep learning frameworks using TornadoVM, reducing the need for hardware-specific code and achieving substantial speedups.

Findings

01

Up to 8x speedup in training on AMD GPUs

02

Effective acceleration of back propagation process

03

Demonstrates feasibility of Java-based deep learning acceleration

Abstract

The advent of modern cloud services along with the huge volume of data produced on a daily basis, have set the demand for fast and efficient data processing. This demand is common among numerous application domains, such as deep learning, data mining, and computer vision. Prior research has focused on employing hardware accelerators as a means to overcome this inefficiency. This trend has driven software development to target heterogeneous execution, and several modern computing systems have incorporated a mixture of diverse computing components, including GPUs and FPGAs. However, the specialization of the applications' code for heterogeneous execution is not a trivial task, as it requires developers to have hardware expertise in order to obtain high performance. The vast majority of the existing deep learning frameworks that support heterogeneous acceleration, rely on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Anomaly Detection Techniques and Applications · Software System Performance and Reliability