Compactly Restrictable Metric Policy Optimization Problems

Victor D. Dorobantu; Kamyar Azizzadenesheli; and Yisong Yue

arXiv:2207.05850·math.OC·July 14, 2022

Compactly Restrictable Metric Policy Optimization Problems

Victor D. Dorobantu, Kamyar Azizzadenesheli, and Yisong Yue

PDF

Open Access

TL;DR

This paper introduces a class of metric policy optimization problems called CR-MPOPs, which are designed to be both expressive for robotic systems and solvable via dynamic programming, with theoretical insights into their properties.

Contribution

The paper defines CR-MPOPs, a new class of optimization problems for deterministic MDPs with metric spaces, and establishes their well-posedness and relevance to control systems.

Findings

01

CR-MPOPs can be characterized using forward-invariance.

02

CR-MPOPs admit solutions via value iteration.

03

Theoretical results relate CR-MPOPs to feedback linearizable systems.

Abstract

We study policy optimization problems for deterministic Markov decision processes (MDPs) with metric state and action spaces, which we refer to as Metric Policy Optimization Problems (MPOPs). Our goal is to establish theoretical results on the well-posedness of MPOPs that can characterize practically relevant continuous control systems. To do so, we define a special class of MPOPs called Compactly Restrictable MPOPs (CR-MPOPs), which are flexible enough to capture the complex behavior of robotic systems but specific enough to admit solutions using dynamic programming methods such as value iteration. We show how to arrive at CR-MPOPs using forward-invariance. We further show that our theoretical results on CR-MPOPs can be used to characterize feedback linearizable control affine systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Formal Methods in Verification