Model-Free $\mu$ Synthesis via Adversarial Reinforcement Learning

Darioush Keivan; Aaron Havens; Peter Seiler; Geir Dullerud; Bin Hu

arXiv:2111.15537·cs.LG·June 9, 2022

Model-Free $\mu$ Synthesis via Adversarial Reinforcement Learning

Darioush Keivan, Aaron Havens, Peter Seiler, Geir Dullerud, Bin Hu

PDF

Open Access

TL;DR

This paper introduces a novel model-free adversarial reinforcement learning approach for $mbda$ synthesis, connecting robust control with RL, and demonstrates its effectiveness through extensive numerical experiments.

Contribution

It develops a model-free $mbda$ synthesis algorithm by integrating adversarial RL with classical control techniques, bridging robust control and reinforcement learning.

Findings

01

The proposed algorithm effectively solves $mbda$ synthesis problems.

02

Numerical results show competitive performance with traditional methods.

03

The study reveals new links between adversarial RL and robust control.

Abstract

Motivated by the recent empirical success of policy-based reinforcement learning (RL), there has been a research trend studying the performance of policy-based RL methods on standard control benchmark problems. In this paper, we examine the effectiveness of policy-based RL methods on an important robust control problem, namely $μ$ synthesis. We build a connection between robust adversarial RL and $μ$ synthesis, and develop a model-free version of the well-known $D K$ -iteration for solving state-feedback $μ$ synthesis with static $D$ -scaling. In the proposed algorithm, the $K$ step mimics the classical central path algorithm via incorporating a recently-developed double-loop adversarial RL method as a subroutine, and the $D$ step is based on model-free finite difference approximation. Extensive numerical study is also presented to demonstrate the utility of our proposed model-free…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvancements in Semiconductor Devices and Circuit Design · Model Reduction and Neural Networks · Probabilistic and Robust Engineering Design