Follow the Clairvoyant: an Imitation Learning Approach to Optimal   Control

Andrea Martin; Luca Furieri; Florian D\"orfler; John Lygeros,; Giancarlo Ferrari-Trecate

arXiv:2211.07389·eess.SY·November 15, 2022

Follow the Clairvoyant: an Imitation Learning Approach to Optimal Control

Andrea Martin, Luca Furieri, Florian D\"orfler, John Lygeros,, Giancarlo Ferrari-Trecate

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel imitation learning approach for optimal control that directly minimizes tracking error relative to the clairvoyant policy, leading to improved performance over regret minimization methods, especially under constraints.

Contribution

It proposes an efficient optimization-based method for follow-the-clairvoyant control, bridging classical control laws and outperforming regret minimization in constrained scenarios.

Findings

01

The approach attains minimal regret without constraints.

02

Numerical experiments show superior tracking of clairvoyant trajectories.

03

Method interpolates between H2 and H∞ control laws.

Abstract

We consider control of dynamical systems through the lens of competitive analysis. Most prior work in this area focuses on minimizing regret, that is, the loss relative to an ideal clairvoyant policy that has noncausal access to past, present, and future disturbances. Motivated by the observation that the optimal cost only provides coarse information about the ideal closed-loop behavior, we instead propose directly minimizing the tracking error relative to the optimal trajectories in hindsight, i.e., imitating the clairvoyant policy. By embracing a system level perspective, we present an efficient optimization-based approach for computing follow-the-clairvoyant (FTC) safe controllers. We prove that these attain minimal regret if no constraints are imposed on the noncausal benchmark. In addition, we present numerical experiments to show that our policy retains the hallmark of competitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

decodepfl/followtheclairvoyant
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Receptor Mechanisms and Signaling