Learning and extrapolating scale-invariant processes
Anaclara Alvez-Canepa, Cyril Furtlehner, Fran\c{c}ois Landes

TL;DR
This paper investigates how machine learning models can predict rare, large-scale events in scale-invariant processes like earthquakes, focusing on leveraging known symmetries such as scale invariance for extrapolation.
Contribution
The study evaluates various neural network architectures, including novel wavelet and Fourier-based models, for their ability to extrapolate scale-free processes and addresses challenges related to spectral biases.
Findings
Scale-invariant models like Riesz networks perform better in extrapolation.
Spectral biases hinder the ability to predict rare large events.
Incorporating inductive biases improves model extrapolation capabilities.
Abstract
Machine Learning (ML) has deeply changed some fields recently, like Language and Vision and we may expect it to be relevant also to the analysis of of complex systems. Here we want to tackle the question of how and to which extent can one regress scale-free processes, i.e. processes displaying power law behavior, like earthquakes or avalanches? We are interested in predicting the large ones, i.e. rare events in the training set which therefore require extrapolation capabilities of the model. For this we consider two paradigmatic problems that are statistically self-similar. The first one is a 2-dimensional fractional Gaussian field obeying linear dynamics, self-similar by construction and amenable to exact analysis. The second one is the Abelian sandpile model, exhibiting self-organized criticality. The emerging paradigm of Geometric Deep Learning shows that including known symmetries…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Gaussian Processes and Bayesian Inference · Stochastic Gradient Optimization Techniques
