Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

Bingdong Li; Mei Jiang; Hong Qian; Ke Tang; Aimin Zhou; Peng Yang

arXiv:2505.19423·cs.LG·May 30, 2025

Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

Bingdong Li, Mei Jiang, Hong Qian, Ke Tang, Aimin Zhou, Peng Yang

PDF

TL;DR

This paper introduces a surrogate-assisted evolutionary reinforcement learning method that uses autoencoders and hyperbolic neural networks to efficiently optimize high-dimensional policies, significantly reducing computational costs and improving search effectiveness.

Contribution

It presents the first learnable policy embedding and surrogate modeling modules specifically designed for high-dimensional ERL policies, enhancing efficiency and exploration.

Findings

01

Outperforms previous methods on Atari and Mujoco benchmarks

02

Guided search trajectories show improved exploration and convergence

03

Surrogate-assisted approach reduces evaluation costs significantly

Abstract

Evolutionary Reinforcement Learning (ERL), training the Reinforcement Learning (RL) policies with Evolutionary Algorithms (EAs), have demonstrated enhanced exploration capabilities and greater robustness than using traditional policy gradient. However, ERL suffers from the high computational costs and low search efficiency, as EAs require evaluating numerous candidate policies with expensive simulations, many of which are ineffective and do not contribute meaningfully to the training. One intuitive way to reduce the ineffective evaluations is to adopt the surrogates. Unfortunately, existing ERL policies are often modeled as deep neural networks (DNNs) and thus naturally represented as high-dimensional vectors containing millions of weights, which makes the building of effective surrogates for ERL policies extremely challenging. This paper proposes a novel surrogate-assisted ERL that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAutoencoders · ADaptive gradient method with the OPTimal convergence rate