Nonequilibrium thermodynamics of self-supervised learning

Domingos S. P. Salazar

arXiv:2106.08981·cond-mat.stat-mech·October 27, 2021

Nonequilibrium thermodynamics of self-supervised learning

Domingos S. P. Salazar

PDF

TL;DR

This paper models self-supervised learning as a nonequilibrium thermodynamic process, revealing how it operates through thermodynamic cycles and feedback mechanisms, and introduces a generalized Gibbs ensemble perspective.

Contribution

It introduces a novel thermodynamic framework for SSL, connecting it to nonequilibrium thermodynamics and generalized Gibbs ensembles, and interprets learning as a feedback-driven thermodynamic cycle.

Findings

01

SSL systems can be modeled as thermodynamic cycles.

02

Learning operates as a feedback cycle extracting negative work.

03

SSL algorithms can be understood through thermodynamic principles.

Abstract

Self-supervised learning (SSL) of energy based models has an intuitive relation to equilibrium thermodynamics because the softmax layer, mapping energies to probabilities, is a Gibbs distribution. However, in what way SSL is a thermodynamic process? We show that some SSL paradigms behave as a thermodynamic composite system formed by representations and self-labels in contact with a nonequilibrium reservoir. Moreover, this system is subjected to usual thermodynamic cycles, such as adiabatic expansion and isochoric heating, resulting in a generalized Gibbs ensemble (GGE). In this picture, we show that learning is seen as a demon that operates in cycles using feedback measurements to extract negative work from the system. As applications, we examine some SSL algorithms using this idea.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDemon · Softmax