Action Redundancy in Reinforcement Learning

Nir Baram; Guy Tennenholtz; Shie Mannor

arXiv:2102.11329·cs.LG·July 27, 2021·1 cites

Action Redundancy in Reinforcement Learning

Nir Baram, Guy Tennenholtz, Shie Mannor

PDF

Open Access

TL;DR

This paper introduces the concept of transition entropy in reinforcement learning, focusing on reducing action redundancy to improve learning efficiency, with algorithms tested on synthetic and benchmark environments.

Contribution

It proposes a novel focus on transition entropy and develops algorithms to minimize action redundancy, addressing a fundamental problem in reinforcement learning.

Findings

01

Action redundancy significantly impacts RL performance.

02

Algorithms effectively reduce redundancy in synthetic and benchmark tests.

03

Transition entropy offers a new perspective beyond action entropy.

Abstract

Maximum Entropy (MaxEnt) reinforcement learning is a powerful learning paradigm which seeks to maximize return under entropy regularization. However, action entropy does not necessarily coincide with state entropy, e.g., when multiple actions produce the same transition. Instead, we propose to maximize the transition entropy, i.e., the entropy of next states. We show that transition entropy can be described by two terms; namely, model-dependent transition entropy and action redundancy. Particularly, we explore the latter in both deterministic and stochastic settings and develop tractable approximation methods in a near model-free setup. We construct algorithms to minimize action redundancy and demonstrate their effectiveness on a synthetic environment with multiple redundant actions as well as contemporary benchmarks in Atari and Mujoco. Our results suggest that action redundancy is a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Neural dynamics and brain function