Extending Straight-Through Estimation for Robust Neural Networks on Analog CIM Hardware
Yuannuo Feng, Wenyong Zhou, Yuexi Lyu, Yixiang Zhang, Zhengwu Liu, Ngai Wong, Wang Kang

TL;DR
This paper introduces an extended Straight-Through Estimator framework that improves noise-aware training for analog CIM hardware, leading to better accuracy, efficiency, and robustness in neural network inference.
Contribution
It proposes a novel STE-based approach that decouples noise simulation from gradient computation, enabling more accurate noise modeling during training.
Findings
Up to 5.3% accuracy improvement on image classification
0.72 perplexity reduction on text generation
2.2× training speedup and 37.9% lower peak memory usage
Abstract
Analog Compute-In-Memory (CIM) architectures promise significant energy efficiency gains for neural network inference, but suffer from complex hardware-induced noise that poses major challenges for deployment. While noise-aware training methods have been proposed to address this issue, they typically rely on idealized and differentiable noise models that fail to capture the full complexity of analog CIM hardware variations. Motivated by the Straight-Through Estimator (STE) framework in quantization, we decouple forward noise simulation from backward gradient computation, enabling noise-aware training with more accurate but computationally intractable noise modeling in analog CIM systems. We provide theoretical analysis demonstrating that our approach preserves essential gradient directional information while maintaining computational tractability and optimization stability. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Vision and Imaging
