Bayes2IMC: In-Memory Computing for Bayesian Binary Neural Networks
Prabodh Katti, Clement Ruah, Osvaldo Simeone, Bashir M. Al-Hashimi, Bipin Rajendran

TL;DR
Bayes2IMC introduces an in-memory computing architecture using phase-change memory to efficiently implement Bayesian binary neural networks, reducing resource consumption while maintaining accuracy.
Contribution
It presents a novel PCM-based IMC design for Bayesian BNNs, eliminating the need for ADCs and introducing hardware-software co-optimization techniques.
Findings
Achieves comparable accuracy to software implementations on CIFAR-10
Provides 3.8 to 9.6 times improvement in power and area efficiency
Surpasses existing memristive device-based BNN architectures in hardware performance
Abstract
Bayesian Neural Networks (BNNs) provide superior estimates of uncertainty by generating an ensemble of predictive distributions. However, inference via ensembling is resource-intensive, requiring additional entropy sources to generate stochasticity which increases resource consumption. We introduce Bayes2IMC, an in-memory computing (IMC) architecture designed for binary Bayesian neural networks that leverage nanoscale device stochasticity to generate desired distributions. Our novel approach utilizes Phase-Change Memory (PCM) to harness inherent noise characteristics, enabling the creation of a binary neural network. This design eliminates the necessity for a pre-neuron Analog-to-Digital Converter (ADC), significantly improving power and area efficiency. We also develop a hardware-software co-optimized correction method applied solely on the logits in the final layer to reduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Advanced Memory and Neural Computing
