Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li, Peizhong Ju, Ness Shroff

TL;DR
This paper introduces a method to reduce the action space in reinforcement learning by grouping similar actions, balancing performance and efficiency through an optimized grouping strategy.
Contribution
It proposes a novel action grouping approach based on transition and reward similarities, with a theoretical analysis and an efficient method for optimal grouping.
Findings
Refined grouping reduces approximation error but increases estimation error with limited samples.
Optimal grouping balances performance loss and computational complexity.
The proposed method maintains efficiency regardless of action space size.
Abstract
Reinforcement learning often needs to deal with the exponential growth of states and actions when exploring optimal control in high-dimensional spaces (often known as the curse of dimensionality). In this work, we address this issue by learning the inherent structure of action-wise similar MDP to appropriately balance the performance degradation versus sample/computational complexity. In particular, we partition the action spaces into multiple groups based on the similarity in transition distribution and reward function, and build a linear decomposition model to capture the difference between the intra-group transition kernel and the intra-group rewards. Both our theoretical analysis and experiments reveal a \emph{surprising and counter-intuitive result}: while a more refined grouping strategy can reduce the approximation error caused by treating actions in the same group as identical,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics
