Statistical learning theory of structured data

Mauro Pastore; Pietro Rotondo; Vittorio Erba; Marco Gherardi

arXiv:2005.10002·cond-mat.stat-mech·October 21, 2020

Statistical learning theory of structured data

Mauro Pastore, Pietro Rotondo, Vittorio Erba, Marco Gherardi

PDF

TL;DR

This paper bridges statistical physics and learning theory to analyze how structured data affects the generalization of models like kernel machines, revealing nonmonotonic entropy behavior and phase transitions linked to data geometry.

Contribution

It introduces a framework integrating physical data models into statistical learning theory and computes Vapnik-Chervonenkis entropy for structured data, highlighting new phenomena.

Findings

01

Entropy is nonmonotonic with sample size for structured data.

02

Data structure induces a transition beyond storage capacity.

03

Low generalization error is associated with the transition point.

Abstract

The traditional approach of statistical physics to supervised learning routinely assumes unrealistic generative models for the data: usually inputs are independent random variables, uncorrelated with their labels. Only recently, statistical physicists started to explore more complex forms of data, such as equally-labelled points lying on (possibly low dimensional) object manifolds. Here we provide a bridge between this recently-established research area and the framework of statistical learning theory, a branch of mathematics devoted to inference in machine learning. The overarching motivation is the inadequacy of the classic rigorous results in explaining the remarkable generalization properties of deep learning. We propose a way to integrate physical models of data into statistical learning theory, and address, with both combinatorial and statistical mechanics methods, the computation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.