Optimal Agnostic Control of Unknown Linear Dynamics in a Bounded   Parameter Range

Jacob Carruth; Maximilian F. Eggl; Charles Fefferman; Clarence W.; Rowley

arXiv:2309.10138·math.OC·September 20, 2023

Optimal Agnostic Control of Unknown Linear Dynamics in a Bounded Parameter Range

Jacob Carruth, Maximilian F. Eggl, Charles Fefferman, Clarence W., Rowley

PDF

Open Access

TL;DR

This paper explores optimal control strategies for unknown linear dynamics with bounded parameters, comparing Bayesian and agnostic approaches, and introduces methods to minimize expected cost and regret under uncertainty.

Contribution

It develops a framework for solving optimal control problems with unknown parameters by reducing agnostic control to Bayesian control using finite priors.

Findings

01

Optimal strategies derived via Hamilton-Jacobi-Bellman PDEs

02

Reduction of agnostic control to Bayesian control with finite priors

03

Strategies minimize expected cost and regret under parameter uncertainty

Abstract

Here and in a follow-on paper, we consider a simple control problem in which the underlying dynamics depend on a parameter $a$ that is unknown and must be learned. In this paper, we assume that $a$ is bounded, i.e., that $∣ a ∣ \leq a_{MAX}$ , and we study two variants of the control problem. In the first variant, Bayesian control, we are given a prior probability distribution for $a$ and we seek a strategy that minimizes the expected value of a given cost function. Assuming that we can solve a certain PDE (the Hamilton-Jacobi-Bellman equation), we produce optimal strategies for Bayesian control. In the second variant, agnostic control, we assume nothing about $a$ and we seek a strategy that minimizes a quantity called the regret. We produce a prior probability distribution $d Prior (a)$ supported on a finite subset of $[- a_{MAX}, a_{MAX}]$ so that the agnostic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks