Online Gaussian Process State-Space Model: Learning and Planning for Partially Observable Dynamical Systems
Soon-Seo Park, Young-Jin Park, Youngjae Min, Han-Lim Choi

TL;DR
This paper introduces an online learning approach for Gaussian process state-space models that efficiently updates with sequential data, enabling adaptive modeling and planning in partially observable dynamical systems.
Contribution
It presents a novel online variational inference method for GP-SSMs that reduces computational costs and supports adaptation, extending their application to reinforcement learning.
Findings
Effective online learning with reduced computational resources.
Supports adaptation to system changes and real environments.
Demonstrated applicability in reinforcement learning scenarios.
Abstract
This paper proposes an online learning method of Gaussian process state-space model (GP-SSM). GP-SSM is a probabilistic representation learning scheme that represents unknown state transition and/or measurement models as Gaussian processes (GPs). While the majority of prior literature on learning of GP-SSM are focused on processing a given set of time series data, data may arrive and accumulate sequentially over time in most dynamical systems. Storing all such sequential data and updating the model over entire data incur large amount of computational resources in space and time. To overcome this difficulty, we propose a practical method, termed \textit{onlineGPSSM}, that incorporates stochastic variational inference (VI) and online VI with novel formulation. The proposed method mitigates the computational complexity without catastrophic forgetting and also support adaptation to changes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Fault Detection and Control Systems · Control Systems and Identification
