Extending Straight-Through Estimation for Robust Neural Networks on Analog CIM Hardware

Yuannuo Feng; Wenyong Zhou; Yuexi Lyu; Yixiang Zhang; Zhengwu Liu; Ngai Wong; Wang Kang

arXiv:2508.11940·cs.LG·August 19, 2025

Extending Straight-Through Estimation for Robust Neural Networks on Analog CIM Hardware

Yuannuo Feng, Wenyong Zhou, Yuexi Lyu, Yixiang Zhang, Zhengwu Liu, Ngai Wong, Wang Kang

PDF

Open Access

TL;DR

This paper introduces an extended Straight-Through Estimator framework that improves noise-aware training for analog CIM hardware, leading to better accuracy, efficiency, and robustness in neural network inference.

Contribution

It proposes a novel STE-based approach that decouples noise simulation from gradient computation, enabling more accurate noise modeling during training.

Findings

01

Up to 5.3% accuracy improvement on image classification

02

0.72 perplexity reduction on text generation

03

2.2× training speedup and 37.9% lower peak memory usage

Abstract

Analog Compute-In-Memory (CIM) architectures promise significant energy efficiency gains for neural network inference, but suffer from complex hardware-induced noise that poses major challenges for deployment. While noise-aware training methods have been proposed to address this issue, they typically rely on idealized and differentiable noise models that fail to capture the full complexity of analog CIM hardware variations. Motivated by the Straight-Through Estimator (STE) framework in quantization, we decouple forward noise simulation from backward gradient computation, enabling noise-aware training with more accurate but computationally intractable noise modeling in analog CIM systems. We provide theoretical analysis demonstrating that our approach preserves essential gradient directional information while maintaining computational tractability and optimization stability. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Vision and Imaging