Convergence of Gradient-based MAML in LQR

Negin Musavi; Geir E. Dullerud

arXiv:2309.06588·eess.SY·September 18, 2023

Convergence of Gradient-based MAML in LQR

Negin Musavi, Geir E. Dullerud

PDF

Open Access

TL;DR

This paper analyzes the local convergence properties of gradient-based MAML in linear quadratic control, providing theoretical guarantees and demonstrating stability and convergence through numerical experiments.

Contribution

It offers the first local convergence guarantees for MAML in LQR, ensuring stability in dynamic systems, which was previously unestablished.

Findings

01

MAML converges locally in LQR tasks under certain conditions.

02

Theoretical guarantees ensure stability of the control system.

03

Numerical results confirm convergence behavior.

Abstract

The main objective of this research paper is to investigate the local convergence characteristics of Model-agnostic Meta-learning (MAML) when applied to linear system quadratic optimal control (LQR). MAML and its variations have become popular techniques for quickly adapting to new tasks by leveraging previous learning knowledge in areas like regression, classification, and reinforcement learning. However, its theoretical guarantees remain unknown due to non-convexity and its structure, making it even more challenging to ensure stability in the dynamic system setting. This study focuses on exploring MAML in the LQR setting, providing its local convergence guarantees while maintaining the stability of the dynamical system. The paper also presents simple numerical results to demonstrate the convergence properties of MAML in LQR tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Model Reduction and Neural Networks

MethodsModel-Agnostic Meta-Learning