A Convex Optimization Approach to Model-Free Inverse Optimal Control with Provable Convergence
Meiling Yu, Lechen Feng, Lei Jiang, Yuan-Hua Ni

TL;DR
This paper introduces a convex optimization framework for model-free inverse optimal control in continuous-time LQR systems, providing the first convergence guarantees and demonstrating superior speed and accuracy over existing methods.
Contribution
It reformulates the joint system and cost estimation problem into a convex second-order cone program with provable convergence, enabling efficient and reliable IOC solutions.
Findings
Achieves a sublinear convergence rate of O(1/k).
Demonstrates an order-of-magnitude faster convergence than benchmarks.
Shows improved accuracy and robustness in simulations.
Abstract
Inverse Optimal Control (IOC) aims to infer the underlying cost functional of an agent from observations of its expert behavior. This paper focuses on the IOC problem within the continuous-time linear quadratic regulator framework, specifically addressing the challenging scenario where both the system dynamics and the cost functional weighting matrices are unknown. A significant limitation of existing methods for this joint estimation problem is the lack of rigorous theoretical guarantees on the convergence and convergence rate of their optimization algorithms, which restricts their application in safety-critical systems. To bridge this theoretical gap, we propose an analytical framework for IOC that provides such guarantees. The core contribution lies in the equivalent reformulation of this non-convex problem of jointly estimating system and cost parameters into a convex second-order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems · Stability and Control of Uncertain Systems
