Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network   Training via Memory-Friendly Pattern Retrieving

Qiyu Wan; Haojun Xia; Xingyao Zhang; Lening Wang; Shuaiwen Leon Song,; Xin Fu

arXiv:2110.03553·cs.AR·October 8, 2021

Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving

Qiyu Wan, Haojun Xia, Xingyao Zhang, Lening Wang, Shuaiwen Leon Song,, Xin Fu

PDF

TL;DR

Shift-BNN introduces a memory-efficient training method for Bayesian Neural Networks by eliminating off-chip data transfer of Gaussian Random Variables, leading to significant improvements in energy efficiency and training speed.

Contribution

This paper presents a novel hardware design that removes off-chip data transfer for BNN training using LFSR reversion, enabling scalable and low-cost BNN accelerators.

Findings

01

Achieves 4.9x average energy efficiency boost

02

Realizes up to 10.8x energy efficiency improvement

03

Provides 1.6x average speedup in BNN training

Abstract

Bayesian Neural Networks (BNNs) that possess a property of uncertainty estimation have been increasingly adopted in a wide range of safety-critical AI applications which demand reliable and robust decision making, e.g., self-driving, rescue robots, medical image diagnosis. The training procedure of a probabilistic BNN model involves training an ensemble of sampled DNN models, which induces orders of magnitude larger volume of data movement than training a single DNN model. In this paper, we reveal that the root cause for BNN training inefficiency originates from the massive off-chip data transfer by Gaussian Random Variables (GRVs). To tackle this challenge, we propose a novel design that eliminates all the off-chip data transfer by GRVs through the reversed shifting of Linear Feedback Shift Registers (LFSRs) without incurring any training accuracy loss. To efficiently support our LFSR…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.