Fundamental limits to learning closed-form mathematical models from data

Oscar Fajardo-Fontiveros; Ignasi Reichardt; Harry R. De Los Rios,; Jordi Duch; Marta Sales-Pardo; Roger Guimera

arXiv:2204.02704·cs.LG·March 22, 2023·1 cites

Fundamental limits to learning closed-form mathematical models from data

Oscar Fajardo-Fontiveros, Ignasi Reichardt, Harry R. De Los Rios,, Jordi Duch, Marta Sales-Pardo, Roger Guimera

PDF

Open Access

TL;DR

This paper investigates the fundamental limits of learning closed-form mathematical models from noisy data, revealing a phase transition between learnability and unlearnability and highlighting the effectiveness of probabilistic model selection.

Contribution

It characterizes the phase transition in model learning from noisy data and compares the performance of probabilistic model selection with standard machine learning methods.

Findings

01

A phase transition exists between learnable and unlearnable regimes.

02

Probabilistic model selection achieves optimal generalization in both phases.

03

Standard neural networks are limited in the low-noise phase by their interpolation ability.

Abstract

Given a finite and noisy dataset generated with a closed-form mathematical model, when is it possible to learn the true generating model from the data alone? This is the question we investigate here. We show that this model-learning problem displays a transition from a low-noise phase in which the true model can be learned, to a phase in which the observation noise is too high for the true model to be learned by any method. Both in the low-noise phase and in the high-noise phase, probabilistic model selection leads to optimal generalization to unseen data. This is in contrast to standard machine learning approaches, including artificial neural networks, which in this particular problem are limited, in the low-noise phase, by their ability to interpolate. In the transition region between the learnable and unlearnable phases, generalization is hard for all approaches including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReservoir Engineering and Simulation Methods · Neural Networks and Applications · Model Reduction and Neural Networks