Model-Based Reinforcement Learning via Stochastic Hybrid Models
Hany Abdulsamad, Jan Peters

TL;DR
This paper presents a hybrid-system approach to model-based reinforcement learning, decomposing nonlinear dynamics into stochastic hybrid models and deriving local controllers through hierarchical optimization, improving interpretability and control of complex systems.
Contribution
It introduces a hierarchical hybrid modeling framework with an EM algorithm for nonlinear dynamics decomposition and a novel Hb-REPS method for policy optimization.
Findings
Decomposes nonlinear dynamics into stochastic hybrid models.
Extracts local polynomial feedback controllers from nonlinear experts.
Optimizes piecewise controllers using hybrid relative entropy policy search.
Abstract
Optimal control of general nonlinear systems is a central challenge in automation. Enabled by powerful function approximators, data-driven approaches to control have recently successfully tackled challenging applications. However, such methods often obscure the structure of dynamics and control behind black-box over-parameterized representations, thus limiting our ability to understand closed-loop behavior. This paper adopts a hybrid-system view of nonlinear modeling and control that lends an explicit hierarchical structure to the problem and breaks down complex dynamics into simpler localized units. We consider a sequence modeling paradigm that captures the temporal structure of the data and derive an expectation-maximization (EM) algorithm that automatically decomposes nonlinear dynamics into stochastic piecewise affine models with nonlinear transition boundaries. Furthermore, we show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Fault Detection and Control Systems
