Dissipative Imitation Learning for Discrete Dynamic Output Feedback   Control with Sparse Data Sets

Amy K. Strong; Ethan J. LoCicero; Leila J. Bridgeman

arXiv:2309.06658·eess.SY·September 14, 2023

Dissipative Imitation Learning for Discrete Dynamic Output Feedback Control with Sparse Data Sets

Amy K. Strong, Ethan J. LoCicero, Leila J. Bridgeman

PDF

Open Access

TL;DR

This paper introduces a dissipative imitation learning approach for discrete dynamic output feedback control that guarantees stability with sparse data and minimal plant model knowledge, using iterative convex overbounding and projected gradient descent.

Contribution

It proposes a novel IO stability-based dissipative imitation learning method that ensures closed-loop stability with limited data and plant model information.

Findings

01

Achieves closed-loop stability with sparse data sets.

02

Successfully mimics expert controller behavior.

03

Outperforms traditional methods in stability and performance.

Abstract

Imitation learning enables the synthesis of controllers for complex objectives and highly uncertain plant models. However, methods to provide stability guarantees to imitation learned controllers often rely on large amounts of data and/or known plant models. In this paper, we explore an input-output (IO) stability approach to dissipative imitation learning, which achieves stability with sparse data sets and with little known about the plant model. A closed-loop stable dynamic output feedback controller is learned using expert data, a coarse IO plant model, and a new constraint to enforce dissipativity on the learned controller. While the learning objective is nonconvex, iterative convex overbounding (ICO) and projected gradient descent (PGD) are explored as methods to successfully learn the controller. This new imitation learning method is applied to two unknown plants and compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Advanced MRI Techniques and Applications

Methodsfail