Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator
Martin Ferianc, Zhiqiang Que, Hongxiang Fan, Wayne Luk, Miguel, Rodrigues

TL;DR
This paper presents an FPGA-based hardware acceleration framework for Bayesian LSTM RNNs, significantly improving speed and energy efficiency for healthcare applications, addressing the computational challenges of Bayesian neural networks.
Contribution
It introduces the first FPGA-based acceleration design for Bayesian RNNs and a co-design framework for optimizing algorithm-hardware configurations.
Findings
Up to 10x speedup over GPU implementation
Nearly 106x higher energy efficiency
Effective application in healthcare scenarios
Abstract
Neural networks have demonstrated their outstanding performance in a wide range of tasks. Specifically recurrent architectures based on long-short term memory (LSTM) cells have manifested excellent capability to model time dependencies in real-world data. However, standard recurrent architectures cannot estimate their uncertainty which is essential for safety-critical applications such as in medicine. In contrast, Bayesian recurrent neural networks (RNNs) are able to provide uncertainty estimation with improved accuracy. Nonetheless, Bayesian RNNs are computationally and memory demanding, which limits their practicality despite their advantages. To address this issue, we propose an FPGA-based hardware design to accelerate Bayesian LSTM-based RNNs. To further improve the overall algorithmic-hardware performance, a co-design framework is proposed to explore the most fitting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
