Control of Overfitting with Physics

Sergei V. Kozyrev; Ilya A Lopatin; Alexander N Pechen

arXiv:2412.10716·cs.LG·December 17, 2024

Control of Overfitting with Physics

Sergei V. Kozyrev, Ilya A Lopatin, Alexander N Pechen

PDF

TL;DR

This paper explores theoretical explanations for overfitting control in machine learning by drawing analogies from physics and biology, providing insights into algorithm stability and GAN behavior.

Contribution

It introduces physics and biology analogies to explain overfitting mechanisms and control strategies in machine learning models.

Findings

01

Eyring formula helps control overfitting in Langevin dynamics.

02

Wide minima with low free energy relate to low overfitting.

03

Biological predator-prey analogy explains GAN overfitting reduction.

Abstract

While there are many works on the applications of machine learning, not so many of them are trying to understand the theoretical justifications to explain their efficiency. In this work, overfitting control (or generalization property) in machine learning is explained using analogies from physics and biology. For stochastic gradient Langevin dynamics, we show that the Eyring formula of kinetic theory allows to control overfitting in the algorithmic stability approach - when wide minima of the risk function with low free energy correspond to low overfitting. For the generative adversarial network (GAN) model, we establish an analogy between GAN and the predator-prey model in biology. An application of this analogy allows us to explain the selection of wide likelihood maxima and overfitting reduction for GANs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.