A Gaussian Comparison Theorem for Training Dynamics in Machine Learning

Ashkan Panahi

arXiv:2603.09310·cs.LG·March 11, 2026

A Gaussian Comparison Theorem for Training Dynamics in Machine Learning

Ashkan Panahi

PDF

Open Access

TL;DR

This paper introduces a Gaussian comparison theorem to analyze training dynamics in machine learning models, providing non-asymptotic results and refinement schemes for better understanding of model evolution.

Contribution

It presents a novel non-asymptotic analysis connecting training dynamics to a surrogate system using Gordon’s theorem, and extends dynamic mean-field theory to non-asymptotic settings.

Findings

01

Validated the dynamic mean-field expressions in asymptotic regimes

02

Developed an iterative scheme for non-asymptotic accuracy

03

Analyzed perceptron training with fluctuation parameters

Abstract

We study training algorithms with data following a Gaussian mixture model. For a specific family of such algorithms, we present a non-asymptotic result, connecting the evolution of the model to a surrogate dynamical system, which can be easier to analyze. The proof of our result is based on the celebrated Gordon comparison theorem. Using our theorem, we rigorously prove the validity of the dynamic mean-field (DMF) expressions in the asymptotic scenarios. Moreover, we suggest an iterative refinement scheme to obtain more accurate expressions in non-asymptotic scenarios. We specialize our theory to the analysis of training a perceptron model with a generic first-order (full-batch) algorithm and demonstrate that fluctuation parameters in a non-asymptotic domain emerge in addition to the DMF kernels.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Gaussian Processes and Bayesian Inference · Neural Networks and Applications