Dissipative Imitation Learning for Discrete Dynamic Output Feedback Control with Sparse Data Sets
Amy K. Strong, Ethan J. LoCicero, Leila J. Bridgeman

TL;DR
This paper introduces a dissipative imitation learning approach for discrete dynamic output feedback control that guarantees stability with sparse data and minimal plant model knowledge, using iterative convex overbounding and projected gradient descent.
Contribution
It proposes a novel IO stability-based dissipative imitation learning method that ensures closed-loop stability with limited data and plant model information.
Findings
Achieves closed-loop stability with sparse data sets.
Successfully mimics expert controller behavior.
Outperforms traditional methods in stability and performance.
Abstract
Imitation learning enables the synthesis of controllers for complex objectives and highly uncertain plant models. However, methods to provide stability guarantees to imitation learned controllers often rely on large amounts of data and/or known plant models. In this paper, we explore an input-output (IO) stability approach to dissipative imitation learning, which achieves stability with sparse data sets and with little known about the plant model. A closed-loop stable dynamic output feedback controller is learned using expert data, a coarse IO plant model, and a new constraint to enforce dissipativity on the learned controller. While the learning objective is nonconvex, iterative convex overbounding (ICO) and projected gradient descent (PGD) are explored as methods to successfully learn the controller. This new imitation learning method is applied to two unknown plants and compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Advanced MRI Techniques and Applications
Methodsfail
