Perturbative Gradient Training: A novel training paradigm for bridging the gap between deep neural networks and physical reservoir computing
Cliff B. Abbott, Mark Elo, and Dmytro A. Bozhko

TL;DR
Perturbative Gradient Training (PGT) introduces a physics-inspired method to enable gradient-based training of physical reservoirs without backpropagation, facilitating integration into neural networks and improving energy efficiency.
Contribution
This paper presents PGT, a new training paradigm that allows gradient updates using only forward passes, overcoming limitations of physical reservoir computing.
Findings
PGT achieves comparable performance to backpropagation in neural networks.
Demonstrated on simulated architectures and physical hardware.
Enables training of physical reservoirs in deep neural networks.
Abstract
We introduce Perturbative Gradient Training (PGT), a novel training paradigm that overcomes a critical limitation of physical reservoir computing: the inability to perform backpropagation due to the black-box nature of physical reservoirs. Drawing inspiration from perturbation theory in physics, PGT uses random perturbations in the network's parameter space to approximate gradient updates using only forward passes. We demonstrate the feasibility of this approach on both simulated neural network architectures, including a dense network and a transformer model with a reservoir layer, and on experimental hardware using a magnonic auto-oscillation ring as the physical reservoir. Our results show that PGT can achieve performance comparable to that of standard backpropagation methods in cases where backpropagation is impractical or impossible. PGT represents a promising step toward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Model Reduction and Neural Networks · Quantum many-body systems
