Logarithmic Regret for Nonlinear Control

James Wang; Bruce D. Lee; Ingvar Ziemann; Nikolai Matni

arXiv:2501.10261·cs.LG·April 14, 2025

Logarithmic Regret for Nonlinear Control

James Wang, Bruce D. Lee, Ingvar Ziemann, Nikolai Matni

PDF

Open Access

TL;DR

This paper establishes logarithmic regret bounds for controlling unknown nonlinear dynamical systems, demonstrating that fast learning is possible under certain conditions, with implications for high-stakes applications.

Contribution

It provides the first regret bounds for nonlinear systems with nonlinear parameter dependence and shows conditions for achieving logarithmic regret in control tasks.

Findings

01

Logarithmic regret achievable with persistently exciting policies.

02

Regret bounds grow with the square root of interactions without persistent excitation.

03

Validation of theoretical results through simulation on a simple system.

Abstract

We address the problem of learning to control an unknown nonlinear dynamical system through sequential interactions. Motivated by high-stakes applications in which mistakes can be catastrophic, such as robotics and healthcare, we study situations where it is possible for fast sequential learning to occur. Fast sequential learning is characterized by the ability of the learning agent to incur logarithmic regret relative to a fully-informed baseline. We demonstrate that fast sequential learning is achievable in a diverse class of continuous control problems where the system dynamics depend smoothly on unknown parameters, provided the optimal control policy is persistently exciting. Additionally, we derive a regret bound which grows with the square root of the number of interactions for cases where the optimal policy is not persistently exciting. Our results provide the first regret bounds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification