On the choice of training data for machine learning of geostrophic mesoscale turbulence
F. E. Yan, J. Mak, Y. Wang

TL;DR
This paper investigates how the choice of training data, specifically filtering rotational components from eddy fluxes, impacts the robustness and skill of machine learning models in ocean turbulence simulations.
Contribution
It provides theoretical and numerical evidence that filtering rotational components from eddy fluxes improves model robustness and performance in geostrophic turbulence modeling.
Findings
Filtering rotational components enhances model robustness.
Learning from filtered fluxes yields comparable or better predictive skill.
Data choice impacts the ability to discover hidden physical processes.
Abstract
'Data' plays a central role in data-driven methods, but is not often the subject of focus in investigations of machine learning algorithms as applied to Earth System Modeling related problems. Here we consider the case of eddy-mean interaction in rotating stratified turbulence in the presence of lateral boundaries, a problem of relevance to ocean modeling, where the eddy fluxes contain dynamically inert rotational components that are expected to contaminate the learning process. An often utilized choice in the literature is to learn from the divergence of the eddy fluxes. Here we provide theoretical arguments and numerical evidence that learning from the eddy fluxes with the rotational component appropriately filtered out results in models with comparable or better skill, but substantially improved robustness. If we simply want a data-driven model to have predictive skill then the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOceanographic and Atmospheric Processes · Climate variability and models · Reservoir Engineering and Simulation Methods
MethodsFocus
