Effects of Different Optimization Formulations in Evolutionary   Reinforcement Learning on Diverse Behavior Generation

Victor Villin; Naoki Masuyama; Yusuke Nojima

arXiv:2110.08122·cs.NE·January 28, 2022

Effects of Different Optimization Formulations in Evolutionary Reinforcement Learning on Diverse Behavior Generation

Victor Villin, Naoki Masuyama, Yusuke Nojima

PDF

TL;DR

This paper investigates how different optimization formulations in evolutionary reinforcement learning affect the diversity and effectiveness of generated behaviors, highlighting the importance of balanced multi-objective optimization.

Contribution

It analyzes the impact of reward signal modulation and evolutionary mechanisms on policy diversity within an existing framework, emphasizing the need for balanced objectives.

Findings

01

Unequal objective consideration reduces behavioral diversity.

02

Balanced multi-objective optimization improves policy effectiveness.

03

Unbalanced formulations can worsen task performance.

Abstract

Generating various strategies for a given task is challenging. However, it has already proven to bring many assets to the main learning process, such as improved behavior exploration. With the growth in the interest of heterogeneity in solution in evolutionary computation and reinforcement learning, many promising approaches have emerged. To better understand how one guides multiple policies toward distinct strategies and benefit from diversity, we need to analyze further the influence of the reward signal modulation and other evolutionary mechanisms on the obtained behaviors. To that effect, this paper considers an existing evolutionary reinforcement learning framework which exploits multi-objective optimization as a way to obtain policies that succeed at behavior-related tasks as well as completing the main goal. Experiments on the Atari games stress that optimization formulations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.