Controlling Transient Amplification Improves Long-horizon Rollouts
Adeel Pervez, Francesco Locatello

TL;DR
This paper identifies transient amplification of errors as a key factor limiting long-horizon predictions in neural simulators and proposes a regularization method to mitigate this, enabling more accurate extended rollouts.
Contribution
It introduces commutativity regularization to reduce error amplification in autoregressive models, improving long-term simulation accuracy.
Findings
Regularization improves long-horizon predictions in synthetic and real data.
Models trained with regularization maintain accuracy over thousands of steps.
Enhanced climate forecast accuracy, especially out-of-distribution, without additional data.
Abstract
Autoregressive neural simulators now match classical solvers on short-horizon prediction of physical systems, yet their accuracy degrades rapidly when rolled out over long horizons. In this work, we identify transient amplification of perturbations around rollout trajectories as a structural mechanism driving rollout error. Using a linearization analysis we show that when the Jacobians along an autoregressive trajectory are non-normal and non-commuting, the model amplifies errors transiently, resulting in model rollout drift even when the overall system is asymptotically stable. Building on the analysis, we propose commutativity regularization: a combination of two penalties designed to reduce the normality defect of individual Jacobians and the commutator norm of Jacobians across steps. The penalties are estimated with Jacobian-vector products and have no inference-time cost. We show a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
