Sparsity-Aware Optimization of In-Memory Bayesian Binary Neural Network   Accelerators

Prabodh Katti; Bashir M. Al-Hashimi; Bipin Rajendran

arXiv:2411.07842·cs.ET·November 13, 2024

Sparsity-Aware Optimization of In-Memory Bayesian Binary Neural Network Accelerators

Prabodh Katti, Bashir M. Al-Hashimi, Bipin Rajendran

PDF

Open Access

TL;DR

This paper introduces a sparsity-aware optimization for Bayesian Binary Neural Network accelerators that exploits sampling sparsity to significantly reduce resource consumption without sacrificing accuracy, enabling more efficient edge AI deployment.

Contribution

The paper proposes a novel sparsity-aware optimization scheme for BBNN accelerators that leverages inherent sampling sparsity to reduce resource usage by up to 86%, with no accuracy loss.

Findings

01

Up to 86% reduction in sampled parameters.

02

Up to 5.3x reduction in area and 8.8x in energy.

03

Achieves 2.9x higher power efficiency than state-of-the-art.

Abstract

Bayesian Neural Networks (BNNs) provide principled estimates of model and data uncertainty by encoding parameters as distributions. This makes them key enablers for reliable AI that can be deployed on safety critical edge systems. These systems can be made resource efficient by restricting synapses to two synaptic states ${- 1, + 1}$ and using a memristive in-memory computing (IMC) paradigm. However, BNNs pose an additional challenge -- they require multiple instantiations for ensembling, consuming extra resources in terms of energy and area. In this work, we propose a novel sparsity-aware optimization for Bayesian Binary Neural Network (BBNN) accelerators that exploits the inherent BBNN sampling sparsity -- most of the network is made up of synapses that have a high probability of being fixed at $\pm 1$ and require no sampling. The optimization scheme proposed here exploits the sampling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fault Detection and Control Systems · Machine Learning and ELM