A statistical physics framework for optimal learning

Francesca Mignacco; Francesco Mori

arXiv:2507.07907·cond-mat.dis-nn·July 11, 2025

A statistical physics framework for optimal learning

Francesca Mignacco, Francesco Mori

PDF

Open Access

TL;DR

This paper develops a statistical physics-based framework to identify optimal learning protocols in neural networks, providing insights into designing strategies that improve generalization and efficiency.

Contribution

It introduces a unified control-theoretic approach to derive optimal learning protocols in high-dimensional neural models, bridging theory and practical applications.

Findings

01

Derived closed-form differential equations for online learning dynamics.

02

Identified optimal curricula and regularization schedules.

03

Demonstrated applicability to real datasets.

Abstract

Learning is a complex dynamical process shaped by a range of interconnected decisions. Careful design of hyperparameter schedules for artificial neural networks or efficient allocation of cognitive resources by biological learners can dramatically affect performance. Yet, theoretical understanding of optimal learning strategies remains sparse, especially due to the intricate interplay between evolving meta-parameters and nonlinear learning dynamics. The search for optimal protocols is further hindered by the high dimensionality of the learning space, often resulting in predominantly heuristic, difficult to interpret, and computationally demanding solutions. Here, we combine statistical physics with control theory in a unified theoretical framework to identify optimal protocols in prototypical neural network models. In the high-dimensional limit, we derive closed-form ordinary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference

MethodsDropout · Adaptive Dropout