Optimal Agnostic Control of Unknown Linear Dynamics in a Bounded Parameter Range
Jacob Carruth, Maximilian F. Eggl, Charles Fefferman, Clarence W., Rowley

TL;DR
This paper explores optimal control strategies for unknown linear dynamics with bounded parameters, comparing Bayesian and agnostic approaches, and introduces methods to minimize expected cost and regret under uncertainty.
Contribution
It develops a framework for solving optimal control problems with unknown parameters by reducing agnostic control to Bayesian control using finite priors.
Findings
Optimal strategies derived via Hamilton-Jacobi-Bellman PDEs
Reduction of agnostic control to Bayesian control with finite priors
Strategies minimize expected cost and regret under parameter uncertainty
Abstract
Here and in a follow-on paper, we consider a simple control problem in which the underlying dynamics depend on a parameter that is unknown and must be learned. In this paper, we assume that is bounded, i.e., that , and we study two variants of the control problem. In the first variant, Bayesian control, we are given a prior probability distribution for and we seek a strategy that minimizes the expected value of a given cost function. Assuming that we can solve a certain PDE (the Hamilton-Jacobi-Bellman equation), we produce optimal strategies for Bayesian control. In the second variant, agnostic control, we assume nothing about and we seek a strategy that minimizes a quantity called the regret. We produce a prior probability distribution supported on a finite subset of so that the agnostic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks
