Tensor-Compressed Back-Propagation-Free Training for (Physics-Informed) Neural Networks
Yequan Zhao, Xinling Yu, Zhixiong Chen, Ziyue Liu, Sijia Liu, Zheng, Zhang

TL;DR
This paper introduces a novel BP-free training framework for neural networks, utilizing tensor compression and hybrid gradient evaluation, enabling efficient on-device training without backpropagation, even for physics-informed neural networks.
Contribution
It proposes a tensor-compressed variance reduction method and a hybrid gradient approach for BP-free training, extending to physics-informed neural networks with derivative estimation via sparse grids.
Findings
Achieves comparable accuracy to standard training on MNIST.
Successfully trains a physics-informed neural network for a 20-dimensional PDE.
Enables memory-efficient on-device training on resource-constrained platforms.
Abstract
Backward propagation (BP) is widely used to compute the gradients in neural network training. However, it is hard to implement BP on edge devices due to the lack of hardware and software resources to support automatic differentiation. This has tremendously increased the design complexity and time-to-market of on-device training accelerators. This paper presents a completely BP-free framework that only requires forward propagation to train realistic neural networks. Our technical contributions are three-fold. Firstly, we present a tensor-compressed variance reduction approach to greatly improve the scalability of zeroth-order (ZO) optimization, making it feasible to handle a network size that is beyond the capability of previous ZO approaches. Secondly, we present a hybrid gradient evaluation approach to improve the efficiency of ZO training. Finally, we extend our BP-free training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Computational Physics and Python Applications · Tensor decomposition and applications
