Thermodynamic Irreversibility of Training Algorithms
Liu Ziyin, Yuanjie Ren, Adam Levine, Isaac Chuang

TL;DR
This paper develops a comprehensive framework to analyze the irreversibility of training algorithms in AI, linking thermodynamic concepts with learning dynamics.
Contribution
It introduces a unified approach to characterize irreversibility in training algorithms, connecting four different measures and revealing their equivalence.
Findings
Irreversibility characterized by four equivalent measures to leading order in step size
Irreversibility induces a time-reversal-symmetry-breaking force in learning dynamics
Preference for trajectories minimizing entropy production rate
Abstract
The training algorithms for AI systems all introduce far-from-equilibrium dynamical processes, and understanding the irreversibility of these algorithms is a fundamental step towards understanding the learning dynamics of modern AI systems. In this work, we establish a general framework for defining and analyzing the irreversibility of training algorithms. We show that four different ways to characterize the irreversibility of dynamical processes are equivalent to leading order in the step size : numerical backward error , time-renormalized correction , microscopic time reversal asymmetry , and the (regularized) stochastic-thermodynamic entropy production . The irreversibility gives rise to a time-reversal-symmetry-breaking emergent force that generically breaks non-isometric continuous reparametrization symmetries,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
