An Actor-Critic Method for Simulation-Based Optimization
Kuo Li, Qing-Shan Jia, Jiaqi Yan

TL;DR
This paper introduces an Actor-Critic reinforcement learning approach for simulation-based optimization, enabling efficient design selection and policy optimization in complex, large-scale problems.
Contribution
It formulates simulation-based optimization as a policy search problem and proposes two algorithms tailored for continuous and discrete feasible spaces.
Findings
Algorithms effectively solve toy and complex tasks
Demonstrates success in adversarial attack and RL tasks
Offers a new perspective on robot control through policy optimization
Abstract
We focus on a simulation-based optimization problem of choosing the best design from the feasible space. Although the simulation model can be queried with finite samples, its internal processing rule cannot be utilized in the optimization process. We formulate the sampling process as a policy searching problem and give a solution from the perspective of Reinforcement Learning (RL). Concretely, Actor-Critic (AC) framework is applied, where the Actor serves as a surrogate model to predict the performance on unknown designs, whereas the actor encodes the sampling policy to be optimized. We design the updating rule and propose two algorithms for the cases where the feasible spaces are continuous and discrete respectively. Some experiments are designed to validate the effectiveness of proposed algorithms, including two toy examples, which intuitively explain the algorithms, and two more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning
