Adaptive Variants of Optimal Feedback Policies

Brett T. Lopez; Jean-Jacques E. Slotine

arXiv:2104.02709·eess.SY·April 13, 2022

Adaptive Variants of Optimal Feedback Policies

Brett T. Lopez, Jean-Jacques E. Slotine

PDF

Open Access

TL;DR

This paper introduces a control-theoretic framework for adaptive optimal feedback policies that maintain effectiveness under parameter variations in uncertain nonlinear systems, with applications in transfer learning and sim-to-real transfer.

Contribution

It proposes a systematic method for adaptive control laws that ensure stability and near-optimal performance despite significant parametric uncertainties.

Findings

01

Guarantees convergence to zero cost state

02

Maintains effectiveness in transfer learning scenarios

03

Achieves near-optimal performance on mountain car problem

Abstract

The stable combination of optimal feedback policies with online learning is studied in a new control-theoretic framework for uncertain nonlinear systems. The framework can be systematically used in transfer learning and sim-to-real applications, where an optimal policy learned for a nominal system needs to remain effective in the presence of significant variations in parameters. Given unknown parameters within a bounded range, the resulting adaptive control laws guarantee convergence of the closed-loop system to the state of zero cost. Online adjustment of the learning rate is used as a key stability mechanism, and preserves certainty equivalence when designing optimal policies without assuming uncertainty to be within the control range. The approach is illustrated on the familiar mountain car problem, where it yields near-optimal performance despite the presence of parametric model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Advanced Control Systems Optimization · Control Systems and Identification