Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network
Bingdong Li, Mei Jiang, Hong Qian, Ke Tang, Aimin Zhou, Peng Yang

TL;DR
This paper introduces a surrogate-assisted evolutionary reinforcement learning method that uses autoencoders and hyperbolic neural networks to efficiently optimize high-dimensional policies, significantly reducing computational costs and improving search effectiveness.
Contribution
It presents the first learnable policy embedding and surrogate modeling modules specifically designed for high-dimensional ERL policies, enhancing efficiency and exploration.
Findings
Outperforms previous methods on Atari and Mujoco benchmarks
Guided search trajectories show improved exploration and convergence
Surrogate-assisted approach reduces evaluation costs significantly
Abstract
Evolutionary Reinforcement Learning (ERL), training the Reinforcement Learning (RL) policies with Evolutionary Algorithms (EAs), have demonstrated enhanced exploration capabilities and greater robustness than using traditional policy gradient. However, ERL suffers from the high computational costs and low search efficiency, as EAs require evaluating numerous candidate policies with expensive simulations, many of which are ineffective and do not contribute meaningfully to the training. One intuitive way to reduce the ineffective evaluations is to adopt the surrogates. Unfortunately, existing ERL policies are often modeled as deep neural networks (DNNs) and thus naturally represented as high-dimensional vectors containing millions of weights, which makes the building of effective surrogates for ERL policies extremely challenging. This paper proposes a novel surrogate-assisted ERL that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAutoencoders · ADaptive gradient method with the OPTimal convergence rate
