Investigating Energy Bounds of Analog Compute-in-Memory with Local Normalization
Brian Rojkov, Shubham Ranjan, Derek Wright, Manoj Sachdev

TL;DR
This paper introduces local normalization in analog compute-in-memory architectures, enabling increased dynamic range and energy efficiency for AI workloads, especially large language models, by using a gain-ranging MAC.
Contribution
It proposes a novel gain-ranging MAC with local normalization, reducing ADC resolution dependence and improving energy efficiency in analog CIM for AI applications.
Findings
Enables 4-bit increase in input dynamic range without extra energy
ADC resolution becomes input distribution invariant
Establishes an energy-efficient upper bound for analog CIM
Abstract
Modern edge AI workloads demand maximum energy efficiency, motivating the pursuit of analog Compute-in-Memory (CIM) architectures. Simultaneously, the popularity of Large-Language-Models (LLMs) drives the adoption of low-bit floating-point formats which prioritize dynamic range. However, the conventional direct-accumulation CIM accommodates floating-points by normalizing them to a shared widened fixed-point scale. Consequently, hardware resolution is dictated by the input's dynamic range rather than its precision, and energy consumption is dominated by the ADC. We address this limitation by introducing local normalization for each input, weight, and multiply-accumulate (MAC) output via a Gain-Ranging MAC (GR-MAC). Normalization overhead is handled by low-power digital logic, enabling the computationally expensive MAC operation to remain in the energy-efficient low-precision analog…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Parallel Computing and Optimization Techniques
