Policy Optimization of Mixed H2/H-infinity Control: Benign Nonconvexity and Global Optimality
Chih-Fan Pai, Yuto Watanabe, Yujie Tang, Yang Zheng

TL;DR
This paper reveals that mixed H2/H-infinity control problems have a benign nonconvex structure where all stationary points are globally optimal, enabling scalable policy optimization methods.
Contribution
It introduces a modern policy optimization perspective, characterizes the feasible set, and develops an Extended Convex Lifting framework to reveal hidden convexity and ensure global optimality.
Findings
Every stationary point is globally optimal in both cases.
The feasible set is open, path-connected, with boundary policies saturating the H-infinity constraint.
The mixed objective is real analytic with explicit gradient formulas.
Abstract
Mixed H2/H-infinity control balances performance and robustness by minimizing an H2 cost bound subject to an H-infinity constraint. However, classical Riccati/LMI solutions offer limited insight into the nonconvex optimization landscape and do not readily scale to large-scale or data-driven settings. In this paper, we revisit mixed H2/H-infinity control from a modern policy optimization viewpoint, including the general two-channel and single-channel cases. One central result is that both cases enjoy a benign nonconvex structure: every stationary point is globally optimal. We characterize the H-infinity-constrained feasible set, which is open, path-connected, with boundary given exactly by policies saturating the H-infinity constraint. We also show that the mixed objective is real analytic in the interior with explicit gradient formulas. Our key analysis builds on an Extended Convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Stability and Control of Uncertain Systems
