Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Klemens Iten; Bruce Lee; Chenhao Li; Lenart Treven; Andreas Krause; Bhavya Sukhija

arXiv:2604.02260·cs.LG·April 3, 2026

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Klemens Iten, Bruce Lee, Chenhao Li, Lenart Treven, Andreas Krause, Bhavya Sukhija

PDF

TL;DR

This paper addresses reinforcement learning for control in systems with changing dynamics, proposing a new algorithm that adapts to non-stationarity and improves performance on benchmarks.

Contribution

It introduces a practical optimistic model-based RL algorithm with adaptive data buffers for non-stationary environments, backed by theoretical analysis.

Findings

01

The proposed method outperforms existing algorithms on non-stationary control benchmarks.

02

Explicitly limiting outdated data improves uncertainty calibration and regret guarantees.

03

Gaussian process models effectively handle time-varying dynamics in RL.

Abstract

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under time-varying dynamics. We consider a continual model-based reinforcement learning setting in which an agent repeatedly learns and controls a dynamical system whose transition dynamics evolve across episodes. We analyze the problem using Gaussian process dynamics models under frequentist variation-budget assumptions. Our analysis shows that persistent non-stationarity requires explicitly limiting the influence of outdated data to maintain calibrated uncertainty and meaningful dynamic regret guarantees. Motivated by these insights, we propose a practical optimistic model-based reinforcement learning algorithm with adaptive data buffer mechanisms and demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.