Policy Optimization over Submanifolds for Linearly Constrained Feedback Synthesis
Shahriar Talebi, Mehran Mesbahi

TL;DR
This paper introduces a Riemannian manifold-based approach for linearly constrained policy optimization in control systems, providing a Newton-type algorithm with convergence guarantees and demonstrating its effectiveness through numerical examples.
Contribution
It develops a novel geometric framework for constrained policy optimization on the manifold of Schur stabilizing controllers, including a new Newton-type algorithm with convergence guarantees.
Findings
The proposed algorithm converges locally without exponential mapping.
Numerical examples demonstrate improved performance over existing methods.
The framework unifies various constrained control problems under a geometric perspective.
Abstract
In this paper, we study linearly constrained policy optimization over the manifold of Schur stabilizing controllers, equipped with a Riemannian metric that emerges naturally in the context of optimal control problems. We provide extrinsic analysis of a generic constrained smooth cost function, that subsequently facilitates subsuming any such constrained problem into this framework. By studying the second order geometry of this manifold, we provide a Newton-type algorithm that does not rely on the exponential mapping nor a retraction, while ensuring local convergence guarantees. The algorithm hinges instead upon the developed stability certificate and the linear structure of the constraints. We then apply our methodology to two well-known constrained optimal control problems. Finally, several numerical examples showcase the performance of the proposed algorithm.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Stability and Control of Uncertain Systems
