Controlling Transient Amplification Improves Long-horizon Rollouts

Adeel Pervez; Francesco Locatello

arXiv:2605.08856·cs.LG·May 18, 2026

Controlling Transient Amplification Improves Long-horizon Rollouts

Adeel Pervez, Francesco Locatello

PDF

TL;DR

This paper identifies transient amplification of errors as a key factor limiting long-horizon predictions in neural simulators and proposes a regularization method to mitigate this, enabling more accurate extended rollouts.

Contribution

It introduces commutativity regularization to reduce error amplification in autoregressive models, improving long-term simulation accuracy.

Findings

01

Regularization improves long-horizon predictions in synthetic and real data.

02

Models trained with regularization maintain accuracy over thousands of steps.

03

Enhanced climate forecast accuracy, especially out-of-distribution, without additional data.

Abstract

Autoregressive neural simulators now match classical solvers on short-horizon prediction of physical systems, yet their accuracy degrades rapidly when rolled out over long horizons. In this work, we identify transient amplification of perturbations around rollout trajectories as a structural mechanism driving rollout error. Using a linearization analysis we show that when the Jacobians along an autoregressive trajectory are non-normal and non-commuting, the model amplifies errors transiently, resulting in model rollout drift even when the overall system is asymptotically stable. Building on the analysis, we propose commutativity regularization: a combination of two penalties designed to reduce the normality defect of individual Jacobians and the commutator norm of Jacobians across steps. The penalties are estimated with Jacobian-vector products and have no inference-time cost. We show a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.