Mean-field Analysis of Generalization Errors

Gholamali Aminian; Samuel N. Cohen; {\L}ukasz Szpruch

arXiv:2306.11623·stat.ML·June 21, 2023·1 cites

Mean-field Analysis of Generalization Errors

Gholamali Aminian, Samuel N. Cohen, {\L}ukasz Szpruch

PDF

Open Access

TL;DR

This paper introduces a new framework using differential calculus on probability measures to analyze generalization errors, establishing conditions for an $ ext{O}(1/n)$ convergence rate in regularized empirical risk minimization, especially for neural networks.

Contribution

It develops a novel mathematical framework for understanding generalization errors via probability measure calculus and applies it to neural networks in the mean-field regime.

Findings

01

Generalization error converges at rate $ ext{O}(1/n)$ under certain conditions.

02

Framework applies to KL-regularized empirical risk minimization.

03

Conditions involve integrability and regularity of loss and activation functions.

Abstract

We propose a novel framework for exploring weak and $L_{2}$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$ , is $O (1/ n)$ . In the context of supervised learning with a one-hidden layer neural network in the mean-field regime, these conditions are reflected in suitable integrability and regularity assumptions on the loss and activation functions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and ELM