Model-Free $\mu$ Synthesis via Adversarial Reinforcement Learning
Darioush Keivan, Aaron Havens, Peter Seiler, Geir Dullerud, Bin Hu

TL;DR
This paper introduces a novel model-free adversarial reinforcement learning approach for $mbda$ synthesis, connecting robust control with RL, and demonstrates its effectiveness through extensive numerical experiments.
Contribution
It develops a model-free $mbda$ synthesis algorithm by integrating adversarial RL with classical control techniques, bridging robust control and reinforcement learning.
Findings
The proposed algorithm effectively solves $mbda$ synthesis problems.
Numerical results show competitive performance with traditional methods.
The study reveals new links between adversarial RL and robust control.
Abstract
Motivated by the recent empirical success of policy-based reinforcement learning (RL), there has been a research trend studying the performance of policy-based RL methods on standard control benchmark problems. In this paper, we examine the effectiveness of policy-based RL methods on an important robust control problem, namely synthesis. We build a connection between robust adversarial RL and synthesis, and develop a model-free version of the well-known -iteration for solving state-feedback synthesis with static -scaling. In the proposed algorithm, the step mimics the classical central path algorithm via incorporating a recently-developed double-loop adversarial RL method as a subroutine, and the step is based on model-free finite difference approximation. Extensive numerical study is also presented to demonstrate the utility of our proposed model-free…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvancements in Semiconductor Devices and Circuit Design · Model Reduction and Neural Networks · Probabilistic and Robust Engineering Design
