Inapplicable Actions Learning for Knowledge Transfer in Reinforcement   Learning

Leo Ardon; Alberto Pozanco; Daniel Borrajo; Sumitra Ganesh

arXiv:2211.15589·cs.LG·May 15, 2023

Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh

PDF

Open Access

TL;DR

This paper introduces a systematic framework for RL agents to learn and utilize inapplicable actions, reducing sample complexity and enabling knowledge transfer across tasks by masking irrelevant actions.

Contribution

It proposes a standardized method for specifying inapplicable actions and a framework for autonomous learning of action preconditions to improve RL efficiency.

Findings

01

Learning inapplicable actions enhances sample efficiency.

02

Knowledge transfer accelerates learning in new tasks.

03

Autonomous precondition learning outperforms hand-crafted approaches.

Abstract

Reinforcement Learning (RL) algorithms are known to scale poorly to environments with many available actions, requiring numerous samples to learn an optimal policy. The traditional approach of considering the same fixed action space in every possible state implies that the agent must understand, while also learning to maximize its reward, to ignore irrelevant actions such as $inapplicable actions$ (i.e. actions that have no effect on the environment when performed in a given state). Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy. While this technique has been formalized for quite some time within the Automated Planning community with the concept of precondition in the STRIPS language, RL algorithms have never formally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Machine Learning and Algorithms