On the Accuracy of Analog Neural Network Inference Accelerators
T. Patrick Xiao, Ben Feinberg, Christopher H. Bennett, Venkatraman, Prabhakar, Prashant Saxena, Vineet Agrawal, Sapan Agarwal, Matthew J., Marinella

TL;DR
This paper investigates how architectural choices in analog neural network accelerators affect inference accuracy, showing that proportionality of analog quantities to weights enhances robustness and efficiency, challenging prior assumptions about bit slicing.
Contribution
It demonstrates that proportional mapping of weights to analog quantities improves accuracy and robustness, offering a new design principle for analog accelerators.
Findings
Proportional analog quantities improve inference accuracy.
Analog storage of weights can match digital resilience without bit slicing.
Design flexibility allows matching hardware precision to algorithm needs.
Abstract
Specialized accelerators have recently garnered attention as a method to reduce the power consumption of neural network inference. A promising category of accelerators utilizes nonvolatile memory arrays to both store weights and perform analog computation inside the array. While prior work has explored the design space of analog accelerators to optimize performance and energy efficiency, there is seldom a rigorous evaluation of the accuracy of these accelerators. This work shows how architectural design decisions, particularly in mapping neural network parameters to analog memory cells, influence inference accuracy. When evaluated using ResNet50 on ImageNet, the resilience of the system to analog non-idealities - cell programming errors, analog-to-digital converter resolution, and array parasitic resistances - all improve when analog quantities in the hardware are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
