Model-Based Reinforcement Learning via Stochastic Hybrid Models

Hany Abdulsamad; Jan Peters

arXiv:2111.06211·eess.SY·June 21, 2023

Model-Based Reinforcement Learning via Stochastic Hybrid Models

Hany Abdulsamad, Jan Peters

PDF

Open Access

TL;DR

This paper presents a hybrid-system approach to model-based reinforcement learning, decomposing nonlinear dynamics into stochastic hybrid models and deriving local controllers through hierarchical optimization, improving interpretability and control of complex systems.

Contribution

It introduces a hierarchical hybrid modeling framework with an EM algorithm for nonlinear dynamics decomposition and a novel Hb-REPS method for policy optimization.

Findings

01

Decomposes nonlinear dynamics into stochastic hybrid models.

02

Extracts local polynomial feedback controllers from nonlinear experts.

03

Optimizes piecewise controllers using hybrid relative entropy policy search.

Abstract

Optimal control of general nonlinear systems is a central challenge in automation. Enabled by powerful function approximators, data-driven approaches to control have recently successfully tackled challenging applications. However, such methods often obscure the structure of dynamics and control behind black-box over-parameterized representations, thus limiting our ability to understand closed-loop behavior. This paper adopts a hybrid-system view of nonlinear modeling and control that lends an explicit hierarchical structure to the problem and breaks down complex dynamics into simpler localized units. We consider a sequence modeling paradigm that captures the temporal structure of the data and derive an expectation-maximization (EM) algorithm that automatically decomposes nonlinear dynamics into stochastic piecewise affine models with nonlinear transition boundaries. Furthermore, we show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Fault Detection and Control Systems