HALF: Holistic Auto Machine Learning for FPGAs

Jonas Ney; Dominik Loroch; Vladimir Rybalkin; Nico Weber; Jens; Kr\"uger; Norbert Wehn

arXiv:2106.14771·cs.AR·October 22, 2021

HALF: Holistic Auto Machine Learning for FPGAs

Jonas Ney, Dominik Loroch, Vladimir Rybalkin, Nico Weber, Jens, Kr\"uger, Norbert Wehn

PDF

TL;DR

This paper introduces HALF, a holistic automated framework for designing optimized DNN implementations on FPGAs, considering multiple design layers and objectives to improve performance and energy efficiency.

Contribution

The paper presents a novel cross-layer exploration methodology and an automated framework, HALF, for optimizing FPGA-based DNN implementations across various design goals.

Findings

01

HALF outperforms TensorRT on Nvidia Jetson in throughput.

02

HALF achieves lower energy consumption for medical DNN applications.

03

The framework effectively balances multiple design objectives.

Abstract

Deep Neural Networks (DNNs) are capable of solving complex problems in domains related to embedded systems, such as image and natural language processing. To efficiently implement DNNs on a specific FPGA platform for a given cost criterion, e.g. energy efficiency, an enormous amount of design parameters has to be considered from the topology down to the final hardware implementation. Interdependencies between the different design layers have to be taken into account and explored efficiently, making it hardly possible to find optimized solutions manually. An automatic, holistic design approach can improve the quality of DNN implementations on FPGA significantly. To this end, we present a cross-layer design space exploration methodology. It comprises optimizations starting from a hardware-aware topology search for DNNs down to the final optimized implementation for a given FPGA platform.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.