Technical Report: Adaptive Control for Linearizable Systems Using   On-Policy Reinforcement Learning

Tyler Westenbroek; Eric Mazumdar; David Fridovich-Keil; Valmik Prabhu,; Claire J. Tomlin; S. Shankar Sastry

arXiv:2004.02766·cs.LG·April 7, 2020·1 cites

Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu,, Claire J. Tomlin, S. Shankar Sastry

PDF

Open Access

TL;DR

This paper introduces a model-free reinforcement learning framework for adaptive control of unknown linearizable systems, enabling stable tracking without requiring invertible inverse models, and demonstrates its effectiveness through simulation.

Contribution

It presents a novel reinforcement learning approach for adaptive control that relaxes invertibility constraints and provides probabilistic stability guarantees.

Findings

01

High-probability convergence of tracking errors

02

Applicability to systems with non-invertible models

03

Successful simulation on a double pendulum

Abstract

This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enables the use of general function approximators to approximate the linearizing controller for the system without having to worry about singularities. However, the discrete-time and stochastic nature of these algorithms precludes the direct application of standard machinery from the adaptive control literature to provide deterministic stability proofs for the system. Nevertheless, we leverage these techniques alongside tools from the stochastic approximation literature to demonstrate that with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Advanced Control Systems Optimization