Singular leaning coefficients and efficiency in learning theory

Miki Aoyagi

arXiv:2501.12747·stat.ML·February 12, 2025

Singular leaning coefficients and efficiency in learning theory

Miki Aoyagi

PDF

Open Access

TL;DR

This paper investigates the theoretical learning efficiency of singular models like neural networks and mixture models by analyzing learning coefficients, extending results to models with ReLU and Softmax activations.

Contribution

It provides new theoretical insights into the learning coefficients of singular models, including deep linear, ReLU, and Softmax neural networks.

Findings

01

Learning coefficients quantify efficiency in singular models.

02

Results extend to ReLU and Softmax neural networks.

03

Theoretical analysis of learning models is advanced.

Abstract

Singular learning models with non-positive Fisher information matrices include neural networks, reduced-rank regression, Boltzmann machines, normal mixture models, and others. These models have been widely used in the development of learning machines. However, theoretical analysis is still in its early stages. In this paper, we examine learning coefficients, which indicate the general learning efficiency of deep linear learning models and three-layer neural network models with ReLU units. Finally, we extend the results to include the case of the Softmax function.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Equations Stability Results

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax