Reducing Action Space for Deep Reinforcement Learning via Causal Effect   Estimation

Wenzhang Liu; Lianjun Jin; Lu Ren; Chaoxu Mu; Changyin Sun

arXiv:2501.14543·cs.LG·January 27, 2025

Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation

Wenzhang Liu, Lianjun Jin, Lu Ren, Chaoxu Mu, Changyin Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal effect estimation method to identify and suppress redundant actions in deep reinforcement learning, improving exploration efficiency in large action spaces.

Contribution

It proposes a novel approach combining inverse dynamics modeling and causal effect estimation to quantitatively reduce action redundancy during exploration.

Findings

01

Enhanced exploration efficiency in environments with large action spaces

02

Quantitative evidence of action causality improves decision-making

03

Theoretical analysis supports the method's effectiveness

Abstract

Intelligent decision-making within large and redundant action spaces remains challenging in deep reinforcement learning. Considering similar but ineffective actions at each step can lead to repetitive and unproductive trials. Existing methods attempt to improve agent exploration by reducing or penalizing redundant actions, yet they fail to provide quantitative and reliable evidence to determine redundancy. In this paper, we propose a method to improve exploration efficiency by estimating the causal effects of actions. Unlike prior methods, our approach offers quantitative results regarding the causality of actions for one-step transitions. We first pre-train an inverse dynamics model to serve as prior knowledge of the environment. Subsequently, we classify actions across the entire action space at each time step and estimate the causal effect of each action to suppress redundant actions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

agi-brain/cee
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Anomaly Detection Techniques and Applications