Transparent FPGA Acceleration with TensorFlow

Simon Pfenning; Philipp Holzinger; Marc Reichenbach

arXiv:2102.06018·cs.AR·February 12, 2021·1 cites

Transparent FPGA Acceleration with TensorFlow

Simon Pfenning, Philipp Holzinger, Marc Reichenbach

PDF

Open Access

TL;DR

This paper presents a flexible FPGA acceleration framework integrated with TensorFlow, enabling dynamic reconfiguration and simplified development for neural network workloads without requiring specialized toolchains.

Contribution

It introduces an HSA-based toolflow that allows FPGA hardware to be dynamically reconfigured during runtime, simplifying neural network acceleration development.

Findings

01

Supports dynamic FPGA reconfiguration during neural network execution

02

Provides a familiar TensorFlow frontend for developers

03

Enables hardware flexibility and runtime adaptability

Abstract

Today, artificial neural networks are one of the major innovators pushing the progress of machine learning. This has particularly affected the development of neural network accelerating hardware. However, since most of these architectures require specialized toolchains, there is a certain amount of additional effort for developers each time they want to make use of a new deep learning accelerator. Furthermore the flexibility of the device is bound to the architecture itself, as well as to the functionality of the runtime environment. In this paper we propose a toolflow using TensorFlow as frontend, thus offering developers the opportunity of using a familiar environment. On the backend we use an FPGA, which is addressable via an HSA runtime environment. In this way we are able to hide the complexity of controlling new hardware from the user, while at the same time maintaining a high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Distributed systems and fault tolerance · Radiation Effects in Electronics