climt-paraformer: Stable Emulation of Convective Parameterization using a Temporal Memory-aware Transformer
Shuochen Wang, Nishant Yadav, Joy Merwin Monteiro, Auroop R. Ganguly

TL;DR
This paper introduces a temporal memory-aware Transformer emulator for convective parameterization in climate models, capturing temporal dependencies to improve accuracy and stability over long simulations.
Contribution
It develops a novel Transformer-based emulator that explicitly models temporal dependencies, outperforming memory-less and recurrent models in climate simulations.
Findings
Transformer achieves lower offline errors than baseline models.
Optimal memory length is around 100 minutes for best performance.
The emulator remains stable over 10-year long-term simulations.
Abstract
Accurate representation of moist convective sub-grid-scale processes remains a major challenge in global climate models, as traditional parameterization schemes are both computationally expensive and difficult to scale. Neural network (NN) emulators offer a promising alternative by learning efficient mappings between atmospheric states and convective tendencies while retaining fidelity to the underlying physics. However, most existing NN-based parameterizations are memory-less and rely only on instantaneous inputs, even though convection evolves over time and depends on prior atmospheric states. Recent studies have begun to incorporate convective memory, but they often treat past states as independent features rather than modeling temporal dependencies explicitly. In this work, we develop a temporal memory-aware Transformer emulator for the Emanuel convective parameterization and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
