Towards High Performance Java-based Deep Learning Frameworks
Athanasios Stratikopoulos, Juan Fumero, Zoran Sevarac, Christos, Kotselidis

TL;DR
This paper demonstrates how using TornadoVM to accelerate a Java-based deep learning framework can significantly improve training performance on GPUs, making deep learning more efficient and accessible.
Contribution
It introduces a method to transparently accelerate Java-based deep learning frameworks using TornadoVM, reducing the need for hardware-specific code and achieving substantial speedups.
Findings
Up to 8x speedup in training on AMD GPUs
Effective acceleration of back propagation process
Demonstrates feasibility of Java-based deep learning acceleration
Abstract
The advent of modern cloud services along with the huge volume of data produced on a daily basis, have set the demand for fast and efficient data processing. This demand is common among numerous application domains, such as deep learning, data mining, and computer vision. Prior research has focused on employing hardware accelerators as a means to overcome this inefficiency. This trend has driven software development to target heterogeneous execution, and several modern computing systems have incorporated a mixture of diverse computing components, including GPUs and FPGAs. However, the specialization of the applications' code for heterogeneous execution is not a trivial task, as it requires developers to have hardware expertise in order to obtain high performance. The vast majority of the existing deep learning frameworks that support heterogeneous acceleration, rely on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Anomaly Detection Techniques and Applications · Software System Performance and Reliability
