Achieving Sample and Computational Efficient Reinforcement Learning by   Action Space Reduction via Grouping

Yining Li; Peizhong Ju; Ness Shroff

arXiv:2306.12981·cs.LG·June 23, 2023·1 cites

Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping

Yining Li, Peizhong Ju, Ness Shroff

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to reduce the action space in reinforcement learning by grouping similar actions, balancing performance and efficiency through an optimized grouping strategy.

Contribution

It proposes a novel action grouping approach based on transition and reward similarities, with a theoretical analysis and an efficient method for optimal grouping.

Findings

01

Refined grouping reduces approximation error but increases estimation error with limited samples.

02

Optimal grouping balances performance loss and computational complexity.

03

The proposed method maintains efficiency regardless of action space size.

Abstract

Reinforcement learning often needs to deal with the exponential growth of states and actions when exploring optimal control in high-dimensional spaces (often known as the curse of dimensionality). In this work, we address this issue by learning the inherent structure of action-wise similar MDP to appropriately balance the performance degradation versus sample/computational complexity. In particular, we partition the action spaces into multiple groups based on the similarity in transition distribution and reward function, and build a linear decomposition model to capture the difference between the intra-group transition kernel and the intra-group rewards. Both our theoretical analysis and experiments reveal a \emph{surprising and counter-intuitive result}: while a more refined grouping strategy can reduce the approximation error caused by treating actions in the same group as identical,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics