Multitask Neuroevolution for Reinforcement Learning with Long and Short   Episodes

Nick Zhang; Abhishek Gupta; Zefeng Chen; and Yew-Soon Ong

arXiv:2203.10844·cs.NE·November 15, 2022

Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes

Nick Zhang, Abhishek Gupta, Zefeng Chen, and Yew-Soon Ong

PDF

Open Access

TL;DR

This paper introduces NuEMT, a neuroevolutionary multitasking algorithm that improves reinforcement learning efficiency by transferring skills from short to long episodes using importance sampling and adaptive resource allocation.

Contribution

The paper presents the first multitask skills transfer mechanism for neuroevolution in RL, addressing high sample complexity in long-horizon tasks.

Findings

01

NuEMT reduces data requirements in RL tasks.

02

It outperforms recent ES baselines in continuous control.

03

Uses importance sampling for skill transfer.

Abstract

Studies have shown evolution strategies (ES) to be a promising approach for reinforcement learning (RL) with deep neural networks. However, the issue of high sample complexity persists in applications of ES to deep RL over long horizons. This paper is the first to address the shortcoming of today's methods via a novel neuroevolutionary multitasking (NuEMT) algorithm, designed to transfer information from a set of auxiliary tasks (of short episode length) to the target (full length) RL task at hand. The auxiliary tasks, extracted from the target, allow an agent to update and quickly evaluate policies on shorter time horizons. The evolved skills are then transferred to guide the longer and harder task towards an optimal policy. We demonstrate that the NuEMT algorithm achieves data-efficient evolutionary RL, reducing expensive agent-environment interaction data requirements. Our key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Advanced Multi-Objective Optimization Algorithms