On a probabilistic approach to synthesize control policies from example   datasets

Davide Gagliardi; Giovanni Russo

arXiv:2005.11191·math.OC·January 11, 2022·Autom.

On a probabilistic approach to synthesize control policies from example datasets

Davide Gagliardi, Giovanni Russo

PDF

1 Repo

TL;DR

This paper presents a probabilistic method to synthesize control policies from example datasets, ensuring system behavior similarity and constraint compliance, even with noisy data and system differences.

Contribution

It introduces a theoretical framework and an explicit algorithm to derive control policies from example data, addressing actuation constraints and system discrepancies.

Findings

01

Effective control policy synthesis from real-world data

02

Guarantees system behavior similarity and constraint adherence

03

Validated with a highway merging case study

Abstract

This paper is concerned with the design of control policies from example datasets. The case considered is when just a black box description of the system to be controlled is available and the system is affected by actuation constraints. These constraints are not necessarily fulfilled by the (possibly, noisy) example data and the system under control is not necessarily the same as the one from which these data are collected. In this context, we introduce a number of theoretical results to compute a control policy from example datasets that: (i) makes the behavior of the closed-loop system similar to the one illustrated in the data; (ii) guarantees compliance with the constraints. We recast the control problem as a finite-horizon optimal control problem and give an explicit expression for its optimal solution. Moreover, we turn our findings into an algorithmic procedure. The procedure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

unisa-acg/discrete-fpd
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.