NeFT: Negative Feedback Training to Improve Robustness of Compute-In-Memory DNN Accelerators
Yifan Qin, Zheyu Yan, Dailin Gan, Jun Xia, Zixuan Pan, Wujie Wen, Xiaobo Sharon Hu, Yiyu Shi

TL;DR
NeFT introduces a control theory-inspired training method that enhances the robustness of compute-in-memory DNN accelerators against device variations, significantly improving inference accuracy and confidence.
Contribution
The paper proposes Negative Feedback Training (NeFT), a novel approach that better captures noisy information during training, outperforming existing methods in robustness against device variations.
Findings
Up to 45.08% accuracy improvement
Reduced epistemic uncertainty and increased confidence
Enhanced convergence probability in DNN inference
Abstract
Compute-in-memory accelerators built upon non-volatile memory devices excel in energy efficiency and latency when performing deep neural network (DNN) inference, thanks to their in-situ data processing capability. However, the stochastic nature and intrinsic variations of non-volatile memory devices often result in performance degradation during DNN inference. Introducing these non-ideal device behaviors in DNN training enhances robustness, but drawbacks include limited accuracy improvement, reduced prediction confidence, and convergence issues. This arises from a mismatch between the deterministic training and non-deterministic device variations, as such training, though considering variations, relies solely on the model's final output. In this work, inspired by control theory, we propose Negative Feedback Training (NeFT), a novel concept supported by theoretical analysis, to more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Stochastic Gradient Optimization Techniques
