Data-driven Rollout for Deterministic Optimal Control

Yuchao Li; Karl H. Johansson; Jonas M{\aa}rtensson; Dimitri P.; Bertsekas

arXiv:2105.03116·math.OC·September 30, 2021

Data-driven Rollout for Deterministic Optimal Control

Yuchao Li, Karl H. Johansson, Jonas M{\aa}rtensson, Dimitri P., Bertsekas

PDF

Open Access

TL;DR

This paper introduces a data-driven rollout algorithm for deterministic infinite horizon optimal control problems, leveraging sampled data and extending to complex scenarios like constraints and multiagent systems.

Contribution

It proposes a novel rollout method based on value and policy iteration that applies broadly to deterministic control problems with arbitrary dynamics and spaces.

Findings

01

Algorithm effectively utilizes sampled data for control optimization.

02

Extensible to problems with trajectory constraints and multiagent systems.

03

Applicable to a wide range of deterministic control scenarios.

Abstract

We consider deterministic infinite horizon optimal control problems with nonnegative stage costs. We draw inspiration from learning model predictive control scheme designed for continuous dynamics and iterative tasks, and propose a rollout algorithm that relies on sampled data generated by some base policy. The proposed algorithm is based on value and policy iteration ideas, and applies to deterministic problems with arbitrary state and control spaces, and arbitrary dynamics. It admits extensions to problems with trajectory constraints, and a multiagent structure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Reinforcement Learning in Robotics