$MC^2RAM$: Markov Chain Monte Carlo Sampling in SRAM for Fast Bayesian Inference
Priyesh Shukla, Ahish Shylendra, Theja Tulabandhula, and Amit Ranjan, Trivedi

TL;DR
This paper introduces a novel SRAM-based architecture for high-speed MCMC sampling of Gaussian mixture models, integrating RNGs, DACs, and ADCs to enable parallel, low-power Bayesian inference.
Contribution
It presents a new SRAM design embedding analog and digital components for efficient MCMC sampling, reducing data movement and improving speed and power efficiency.
Findings
Consumes ~91 micro-Watts per sample at 1 GHz
Generates 500 samples in 2000 cycles
Highlights impact of hardware non-idealities on sampling quality
Abstract
This work discusses the implementation of Markov Chain Monte Carlo (MCMC) sampling from an arbitrary Gaussian mixture model (GMM) within SRAM. We show a novel architecture of SRAM by embedding it with random number generators (RNGs), digital-to-analog converters (DACs), and analog-to-digital converters (ADCs) so that SRAM arrays can be used for high performance Metropolis-Hastings (MH) algorithm-based MCMC sampling. Most of the expensive computations are performed within the SRAM and can be parallelized for high speed sampling. Our iterative compute flow minimizes data movement during sampling. We characterize power-performance trade-off of our design by simulating on 45 nm CMOS technology. For a two-dimensional, two mixture GMM, the implementation consumes ~ 91 micro-Watts power per sampling iteration and produces 500 samples in 2000 clock cycles on an average at 1 GHz clock frequency.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Target Tracking and Data Fusion in Sensor Networks
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
