An Actor-Critic Method for Simulation-Based Optimization

Kuo Li; Qing-Shan Jia; Jiaqi Yan

arXiv:2111.00435·cs.LG·November 2, 2021

An Actor-Critic Method for Simulation-Based Optimization

Kuo Li, Qing-Shan Jia, Jiaqi Yan

PDF

Open Access

TL;DR

This paper introduces an Actor-Critic reinforcement learning approach for simulation-based optimization, enabling efficient design selection and policy optimization in complex, large-scale problems.

Contribution

It formulates simulation-based optimization as a policy search problem and proposes two algorithms tailored for continuous and discrete feasible spaces.

Findings

01

Algorithms effectively solve toy and complex tasks

02

Demonstrates success in adversarial attack and RL tasks

03

Offers a new perspective on robot control through policy optimization

Abstract

We focus on a simulation-based optimization problem of choosing the best design from the feasible space. Although the simulation model can be queried with finite samples, its internal processing rule cannot be utilized in the optimization process. We formulate the sampling process as a policy searching problem and give a solution from the perspective of Reinforcement Learning (RL). Concretely, Actor-Critic (AC) framework is applied, where the Actor serves as a surrogate model to predict the performance on unknown designs, whereas the actor encodes the sampling policy to be optimized. We design the updating rule and propose two algorithms for the cases where the feasible spaces are continuous and discrete respectively. Some experiments are designed to validate the effectiveness of proposed algorithms, including two toy examples, which intuitively explain the algorithms, and two more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning