Calibrating Geophysical Predictions under Constrained Probabilistic Distributions
Zhewen Hou, Jiajin Sun, Subashree Venkatasubramanian, Peter Jin, Shuolin Li, Tian Zheng

TL;DR
This paper presents a novel calibration method using Kernelized Stein Discrepancy to improve machine learning predictions of geophysical systems, ensuring consistency with known physical distributions and long-term attractors.
Contribution
It introduces a distribution-informed calibration framework that enhances ML predictions by incorporating prior knowledge of marginal distributions, especially effective with sparse data.
Findings
Improves prediction fidelity to physical distributions
Ensures consistency with long-term statistical structures
Demonstrates robustness in climatological and flow simulations
Abstract
Machine learning (ML) has shown significant promise in studying complex geophysical dynamical systems, including turbulence and climate processes. Such systems often display sensitive dependence on initial conditions, reflected in positive Lyapunov exponents, where even small perturbations in short-term forecasts can lead to large deviations in long-term outcomes. Thus, meaningful inference requires not only accurate short-term predictions, but also consistency with the system's long-term attractor that is captured by the marginal distribution of state variables. Existing approaches attempt to address this challenge by incorporating spatial and temporal dependence, but these strategies become impractical when data are extremely sparse. In this work, we show that prior knowledge of marginal distributions offers valuable complementary information to short-term observations, motivating a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference
