Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh

TL;DR
This paper introduces a systematic framework for RL agents to learn and utilize inapplicable actions, reducing sample complexity and enabling knowledge transfer across tasks by masking irrelevant actions.
Contribution
It proposes a standardized method for specifying inapplicable actions and a framework for autonomous learning of action preconditions to improve RL efficiency.
Findings
Learning inapplicable actions enhances sample efficiency.
Knowledge transfer accelerates learning in new tasks.
Autonomous precondition learning outperforms hand-crafted approaches.
Abstract
Reinforcement Learning (RL) algorithms are known to scale poorly to environments with many available actions, requiring numerous samples to learn an optimal policy. The traditional approach of considering the same fixed action space in every possible state implies that the agent must understand, while also learning to maximize its reward, to ignore irrelevant actions such as (i.e. actions that have no effect on the environment when performed in a given state). Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy. While this technique has been formalized for quite some time within the Automated Planning community with the concept of precondition in the STRIPS language, RL algorithms have never formally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Machine Learning and Algorithms
