Enabling surrogate-assisted evolutionary reinforcement learning via   policy embedding

Lan Tang; Xiaxi Li; Jinyuan Zhang; Guiying Li; Peng Yang; Ke Tang

arXiv:2301.13374·cs.NE·February 1, 2023

Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding

Lan Tang, Xiaxi Li, Jinyuan Zhang, Guiying Li, Peng Yang, Ke Tang

PDF

Open Access

TL;DR

This paper introduces PE-SAERL, a framework that integrates surrogate models with policy embedding to significantly accelerate evolutionary reinforcement learning, demonstrated by up to 7x faster training on Atari games.

Contribution

It is the first to enable surrogate-assisted ERL through policy embedding, addressing high-dimensional policy challenges and improving training efficiency.

Findings

01

Training speed up to 7x on Atari games

02

Outperforms four state-of-the-art algorithms

03

Effective surrogate integration for high-dimensional policies

Abstract

Evolutionary Reinforcement Learning (ERL) that applying Evolutionary Algorithms (EAs) to optimize the weight parameters of Deep Neural Network (DNN) based policies has been widely regarded as an alternative to traditional reinforcement learning methods. However, the evaluation of the iteratively generated population usually requires a large amount of computational time and can be prohibitively expensive, which may potentially restrict the applicability of ERL. Surrogate is often used to reduce the computational burden of evaluation in EAs. Unfortunately, in ERL, each individual of policy usually represents millions of weights parameters of DNN. This high-dimensional representation of policy has introduced a great challenge to the application of surrogates into ERL to speed up training. This paper proposes a PE-SAERL Framework to at the first time enable surrogate-assisted evolutionary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Sports Analytics and Performance · Reinforcement Learning in Robotics

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings