A comparison of data-driven approaches to build low-dimensional ocean models
Niraj Agarwal, Dmitri Kondrashov, Peter Dueben, Evgenii Ryzhov and, Pavel Berloff

TL;DR
This paper compares linear regression, stochastic, and deep-learning methods for simplified ocean models, finding multi-level stochastic LR and hybrid models most effective for accurate and cost-efficient ocean simulation.
Contribution
It systematically evaluates and compares various statistical modeling approaches for ocean circulation, highlighting the superior performance of multi-level stochastic LR and hybrid models.
Findings
Multi-level stochastic LR performs best for short and long-term forecasts.
Hybrid models with deep learning and noise augmentation outperform pure deep learning.
LR with white noise extension outperforms pure LR on long timescales.
Abstract
We present a comprehensive inter-comparison of linear regression (LR), stochastic, and deep-learning approaches for reduced-order statistical emulation of ocean circulation. The reference dataset is provided by an idealized, eddy-resolving, double-gyre ocean circulation model. Our goal is to conduct a systematic and comprehensive assessment and comparison of skill, cost, and complexity of statistical models from the three methodological classes. The model based on LR is considered as a baseline. Additionally, we investigate its additive white noise augmentation and a multi-level stochastic approach, deep-learning methods, hybrid frameworks (LR plus deep-learning), and simple stochastic extensions of deep-learning and hybrid methods. The assessment metrics considered are: root mean squared error, anomaly cross-correlation, climatology, variance, frequency map, forecast horizon, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
