Reinforcement Learning With Sparse-Executing Actions via Sparsity   Regularization

Jing-Cheng Pang; Tian Xu; Shengyi Jiang; Yu-Ren Liu; Yang Yu

arXiv:2105.08666·cs.LG·July 23, 2024

Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization

Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu, Yang Yu

PDF

Open Access

TL;DR

This paper introduces a new RL framework and algorithm for decision-making tasks where actions can only be executed a limited number of times, addressing a common challenge in real-world applications.

Contribution

The paper formalizes the Sparse Action Markov Decision Process and proposes the ASRE algorithm, which adaptively manages action sparsity during policy learning.

Findings

01

ASRE effectively handles sparse actions in RL tasks.

02

ASRE outperforms classical RL algorithms in sparse-action environments.

03

ASRE improves performance in Atari games with action constraints.

Abstract

Reinforcement learning (RL) has demonstrated impressive performance in decision-making tasks like embodied control, autonomous driving and financial trading. In many decision-making tasks, the agents often encounter the problem of executing actions under limited budgets. However, classic RL methods typically overlook the challenges posed by such sparse-executing actions. They operate under the assumption that all actions can be taken for a unlimited number of times, both in the formulation of the problem and in the development of effective algorithms. To tackle the issue of limited action execution in RL, this paper first formalizes the problem as a Sparse Action Markov Decision Process (SA-MDP), in which specific actions in the action space can only be executed for a limited time. Then, we propose a policy optimization algorithm, Action Sparsity REgularization (ASRE), which adaptively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Advanced Bandit Algorithms Research