Instance Weighted Incremental Evolution Strategies for Reinforcement   Learning in Dynamic Environments

Zhi Wang; Chunlin Chen; Daoyi Dong

arXiv:2010.04605·cs.LG·April 1, 2022

Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments

Zhi Wang, Chunlin Chen, Daoyi Dong

PDF

1 Repo

TL;DR

This paper introduces IW-IES, a scalable evolution strategy that adaptively updates policies in dynamic environments by weighting instances based on novelty and quality, enabling rapid learning adaptation.

Contribution

It proposes a novel incremental learning method for evolution strategies in dynamic environments, incorporating instance weighting to improve adaptation and scalability.

Findings

01

IW-IES outperforms existing methods on RL tasks.

02

The weighting mechanism accelerates adaptation to environment changes.

03

The approach is effective in diverse tasks like navigation and locomotion.

Abstract

Evolution strategies (ES), as a family of black-box optimization algorithms, recently emerge as a scalable alternative to reinforcement learning (RL) approaches such as Q-learning or policy gradient, and are much faster when many central processing units (CPUs) are available due to better parallelization. In this paper, we propose a systematic incremental learning method for ES in dynamic environments. The goal is to adjust previously learned policy to a new one incrementally whenever the environment changes. We incorporate an instance weighting mechanism with ES to facilitate its learning adaptation, while retaining scalability of ES. During parameter updating, higher weights are assigned to instances that contain more new knowledge, thus encouraging the search distribution to move towards new promising areas of parameter space. We propose two easy-to-implement metrics to calculate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HeyuanMingong/iwies
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning