BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel   Optimization

Junyi Wang; Yuanyang Zhu; Zhi Wang; Yan Zheng; Jianye Hao; Chunlin; Chen

arXiv:2308.01207·cs.NE·August 3, 2023

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization

Junyi Wang, Yuanyang Zhu, Zhi Wang, Yan Zheng, Jianye Hao, Chunlin, Chen

PDF

Open Access 1 Repo

TL;DR

BiERL introduces a bilevel optimization framework for meta reinforcement learning that jointly updates hyperparameters during training, improving exploration and performance across diverse ERL algorithms without prior domain knowledge.

Contribution

The paper proposes a novel bilevel optimization-based meta ERL framework that enables parallel hyperparameter tuning within a single agent, enhancing learning efficiency and robustness.

Findings

01

BiERL outperforms various baselines in MuJoCo and Box2D tasks.

02

It consistently improves the learning performance of different ERL algorithms.

03

The framework reduces the need for prior domain knowledge and costly hyperparameter tuning.

Abstract

Evolutionary reinforcement learning (ERL) algorithms recently raise attention in tackling complex reinforcement learning (RL) problems due to high parallelism, while they are prone to insufficient exploration or model collapse without carefully tuning hyperparameters (aka meta-parameters). In the paper, we propose a general meta ERL framework via bilevel optimization (BiERL) to jointly update hyperparameters in parallel to training the ERL model within a single agent, which relieves the need for prior domain knowledge or costly optimization procedure before model deployment. We design an elegant meta-level architecture that embeds the inner-level's evolving experience into an informative population representation and introduce a simple and feasible evaluation of the meta-level fitness function to facilitate learning efficiency. We perform extensive experiments in MuJoCo and Box2D tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chriswang98sz/bierl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Metaheuristic Optimization Algorithms Research