Optimizing Bayesian Recurrent Neural Networks on an FPGA-based   Accelerator

Martin Ferianc; Zhiqiang Que; Hongxiang Fan; Wayne Luk; Miguel; Rodrigues

arXiv:2106.06048·cs.LG·November 9, 2021

Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator

Martin Ferianc, Zhiqiang Que, Hongxiang Fan, Wayne Luk, Miguel, Rodrigues

PDF

TL;DR

This paper presents an FPGA-based hardware acceleration framework for Bayesian LSTM RNNs, significantly improving speed and energy efficiency for healthcare applications, addressing the computational challenges of Bayesian neural networks.

Contribution

It introduces the first FPGA-based acceleration design for Bayesian RNNs and a co-design framework for optimizing algorithm-hardware configurations.

Findings

01

Up to 10x speedup over GPU implementation

02

Nearly 106x higher energy efficiency

03

Effective application in healthcare scenarios

Abstract

Neural networks have demonstrated their outstanding performance in a wide range of tasks. Specifically recurrent architectures based on long-short term memory (LSTM) cells have manifested excellent capability to model time dependencies in real-world data. However, standard recurrent architectures cannot estimate their uncertainty which is essential for safety-critical applications such as in medicine. In contrast, Bayesian recurrent neural networks (RNNs) are able to provide uncertainty estimation with improved accuracy. Nonetheless, Bayesian RNNs are computationally and memory demanding, which limits their practicality despite their advantages. To address this issue, we propose an FPGA-based hardware design to accelerate Bayesian LSTM-based RNNs. To further improve the overall algorithmic-hardware performance, a co-design framework is proposed to explore the most fitting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.