First Demonstration of Second-order Training of Deep Neural Networks with In-memory Analog Matrix Computing
Saitao Zhang, Yubiao Luo, Shiqing Wang, Pushen Zuo, Yongxiang Li, Lunshuai Pan, Zheng Miao, Zhong Sun

TL;DR
This paper introduces a novel in-memory analog matrix computing approach for second-order neural network training, significantly reducing training epochs and improving energy efficiency compared to traditional methods.
Contribution
The first demonstration of a second-order optimizer using in-memory analog matrix computing with RRAM for efficient neural network training.
Findings
Achieved 26% and 61% fewer epochs than SGD and Adam.
Delivered 5.88x throughput and 6.9x energy efficiency gains.
Validated on CNN for handwritten letter classification.
Abstract
Second-order optimization methods, which leverage curvature information, offer faster and more stable convergence than first-order methods such as stochastic gradient descent (SGD) and Adam. However, their practical adoption is hindered by the prohibitively high cost of inverting the second-order information matrix, particularly in large-scale neural network training. Here, we present the first demonstration of a second-order optimizer powered by in-memory analog matrix computing (AMC) using resistive random-access memory (RRAM), which performs matrix inversion (INV) in a single step. We validate the optimizer by training a two-layer convolutional neural network (CNN) for handwritten letter classification, achieving 26% and 61% fewer training epochs than SGD with momentum and Adam, respectively. On a larger task using the same second-order method, our system delivers a 5.88x improvement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Magnetic properties of thin films
