Policy Optimization over Submanifolds for Linearly Constrained Feedback   Synthesis

Shahriar Talebi; Mehran Mesbahi

arXiv:2201.11157·math.OC·October 27, 2023

Policy Optimization over Submanifolds for Linearly Constrained Feedback Synthesis

Shahriar Talebi, Mehran Mesbahi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Riemannian manifold-based approach for linearly constrained policy optimization in control systems, providing a Newton-type algorithm with convergence guarantees and demonstrating its effectiveness through numerical examples.

Contribution

It develops a novel geometric framework for constrained policy optimization on the manifold of Schur stabilizing controllers, including a new Newton-type algorithm with convergence guarantees.

Findings

01

The proposed algorithm converges locally without exponential mapping.

02

Numerical examples demonstrate improved performance over existing methods.

03

The framework unifies various constrained control problems under a geometric perspective.

Abstract

In this paper, we study linearly constrained policy optimization over the manifold of Schur stabilizing controllers, equipped with a Riemannian metric that emerges naturally in the context of optimal control problems. We provide extrinsic analysis of a generic constrained smooth cost function, that subsequently facilitates subsuming any such constrained problem into this framework. By studying the second order geometry of this manifold, we provide a Newton-type algorithm that does not rely on the exponential mapping nor a retraction, while ensuring local convergence guarantees. The algorithm hinges instead upon the developed stability certificate and the linear structure of the constraints. We then apply our methodology to two well-known constrained optimal control problems. Finally, several numerical examples showcase the performance of the proposed algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shahriarta/qrnpo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Stability and Control of Uncertain Systems