Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes
Nick Zhang, Abhishek Gupta, Zefeng Chen, and Yew-Soon Ong

TL;DR
This paper introduces NuEMT, a neuroevolutionary multitasking algorithm that improves reinforcement learning efficiency by transferring skills from short to long episodes using importance sampling and adaptive resource allocation.
Contribution
The paper presents the first multitask skills transfer mechanism for neuroevolution in RL, addressing high sample complexity in long-horizon tasks.
Findings
NuEMT reduces data requirements in RL tasks.
It outperforms recent ES baselines in continuous control.
Uses importance sampling for skill transfer.
Abstract
Studies have shown evolution strategies (ES) to be a promising approach for reinforcement learning (RL) with deep neural networks. However, the issue of high sample complexity persists in applications of ES to deep RL over long horizons. This paper is the first to address the shortcoming of today's methods via a novel neuroevolutionary multitasking (NuEMT) algorithm, designed to transfer information from a set of auxiliary tasks (of short episode length) to the target (full length) RL task at hand. The auxiliary tasks, extracted from the target, allow an agent to update and quickly evaluate policies on shorter time horizons. The evolved skills are then transferred to guide the longer and harder task towards an optimal policy. We demonstrate that the NuEMT algorithm achieves data-efficient evolutionary RL, reducing expensive agent-environment interaction data requirements. Our key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Advanced Multi-Objective Optimization Algorithms
