Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

Chengqian Gao; William de Vazelhes; Hualin Zhang; Bin Gu; Zhiqiang Xu

arXiv:2405.01615·cs.NE·May 6, 2024

Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

Chengqian Gao, William de Vazelhes, Hualin Zhang, Bin Gu, Zhiqiang Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces NESHT, a novel method combining Hard-Thresholding with Natural Evolution Strategies to improve feature relevance in reinforcement learning, especially in noisy, real-world scenarios.

Contribution

It proposes NESHT, a new approach that enhances NES by promoting sparsity, effectively filtering out irrelevant features in reinforcement learning tasks.

Findings

01

NESHT outperforms standard NES in noisy Mujoco and Atari tasks.

02

It effectively mitigates the impact of irrelevant features.

03

Empirical results show improved decision-making performance.

Abstract

Evolution Strategies (ES) have emerged as a competitive alternative for model-free reinforcement learning, showcasing exemplary performance in tasks like Mujoco and Atari. Notably, they shine in scenarios with imperfect reward functions, making them invaluable for real-world applications where dense reward signals may be elusive. Yet, an inherent assumption in ES, that all input features are task-relevant, poses challenges, especially when confronted with irrelevant features common in real-world problems. This work scrutinizes this limitation, particularly focusing on the Natural Evolution Strategies (NES) variant. We propose NESHT, a novel approach that integrates Hard-Thresholding (HT) with NES to champion sparsity, ensuring only pertinent features are employed. Backed by rigorous analysis and empirical tests, NESHT demonstrates its promise in mitigating the pitfalls of irrelevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cangcn/nes-ht
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications