Online Optimal Control with Linear Dynamics and Predictions: Algorithms   and Regret Analysis

Yingying Li; Xin Chen; Na Li

arXiv:1906.11378·math.OC·October 23, 2019·NeurIPS·29 cites

Online Optimal Control with Linear Dynamics and Predictions: Algorithms and Regret Analysis

Yingying Li, Xin Chen, Na Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Receding Horizon Gradient-based Control (RHGC) algorithm for online linear control with lookahead predictions, achieving near-optimal regret decay and demonstrating effectiveness through numerical tests.

Contribution

The paper proposes a novel online control algorithm that leverages finite lookahead predictions, with theoretical regret bounds close to fundamental limits, and validates its performance numerically.

Findings

01

Regret decays exponentially with lookahead window size.

02

RHGC nearly attains the fundamental regret limit.

03

Numerical tests confirm effectiveness for linear and nonlinear systems.

Abstract

This paper studies the online optimal control problem with time-varying convex stage costs for a time-invariant linear dynamical system, where a finite lookahead window of accurate predictions of the stage costs are available at each time. We design online algorithms, Receding Horizon Gradient-based Control (RHGC), that utilize the predictions through finite steps of gradient computations. We study the algorithm performance measured by dynamic regret: the online performance minus the optimal performance in hindsight. It is shown that the dynamic regret of RHGC decays exponentially with the size of the lookahead window. In addition, we provide a fundamental limit of the dynamic regret for any online algorithms by considering linear quadratic tracking problems. The regret upper bound of one RHGC method almost reaches the fundamental limit, demonstrating the effectiveness of the algorithm.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

li-yingying/RHGC
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Adaptive Dynamic Programming Control · Age of Information Optimization