Model selection for stochastic dynamics: a parsimonious and principled approach
Andonis Gerardos

TL;DR
This paper introduces PASTIS, a new information criterion based on extreme value theory, for selecting simple, accurate stochastic models from large candidate libraries, outperforming traditional criteria in various benchmarks.
Contribution
The paper presents PASTIS, a novel, statistically grounded model selection method for stochastic differential equations, incorporating a significance threshold and robust variants for noisy or sparse data.
Findings
PASTIS outperforms AIC, BIC, CV, and SINDy in model identification.
Benchmarks include Lorenz, Ornstein-Uhlenbeck, Lotka-Volterra, and Gray-Scott systems.
Robust variants extend applicability to noisy and sparse data.
Abstract
This thesis focuses on the discovery of stochastic differential equations (SDEs) and stochastic partial differential equations (SPDEs) from noisy and discrete time series. A major challenge is selecting the simplest possible correct model from vast libraries of candidate models, where standard information criteria (AIC, BIC) are often limited. We introduce PASTIS (Parsimonious Stochastic Inference), a new information criterion derived from extreme value theory. Its penalty term, , explicitly incorporates the size of the initial library of candidate parameters (), the number of parameters in the considered model (), and a significance threshold (). This significance threshold represents the probability of selecting a model containing more parameters than necessary when comparing many models. Benchmarks on various systems (Lorenz,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
