Symmetric Equilibrium Propagation for Thermodynamic Diffusion Training
Aditi De

TL;DR
This paper introduces symmetric equilibrium propagation for thermodynamic diffusion training, enabling energy-efficient, local, readout-only learning rules that can be physically realized without digital gradient routing.
Contribution
It demonstrates that equilibrium propagation can be applied directly to a bilinear substrate for unbiased gradient estimation, improving bias properties with symmetric nudging and achieving significant energy savings.
Findings
Bias bound controlled by substrate properties
Symmetric nudging reduces bias from O(β) to O(β^2)
Achieves 10^3-10^4× energy advantage over GPU baseline
Abstract
The reverse process in score-based diffusion models is formally equivalent to overdamped Langevin dynamics in a time-dependent energy landscape. In our prior work we showed that a bilinearly-coupled analog substrate can physically realize this dynamics at a projected three-to-four orders of magnitude energy advantage over digital inference by replacing dense skip connections with low-rank inter-module couplings. Whether the \emph{training} loop can be closed on the same substrate -- without routing gradients through an external digital accelerator -- has remained open. We resolve this affirmatively: Equilibrium Propagation applied directly to the bilinear energy yields an unbiased estimator of the denoising score-matching gradient in the zero-nudge limit. For finite nudging we derive a sharp bias bound controlled solely by substrate stiffness, local curvature, and the norm of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
