Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods
Niklas H\"opner, Ilaria Tiddi, Herke van Hoof

TL;DR
This paper introduces a residual policy gradient method that leverages class hierarchies from knowledge graphs to improve reinforcement learning in knowledge-rich environments, enhancing sample efficiency and generalization.
Contribution
It presents a novel approach to incorporate class abstraction from knowledge graphs into RL via residual policy gradients, addressing challenges in leveraging unstructured knowledge.
Findings
Improved sample efficiency in commonsense games.
Enhanced generalization to unseen objects.
Identified limitations due to noisy class knowledge.
Abstract
Enabling reinforcement learning (RL) agents to leverage a knowledge base while learning from experience promises to advance RL in knowledge intensive domains. However, it has proven difficult to leverage knowledge that is not manually tailored to the environment. We propose to use the subclass relationships present in open-source knowledge graphs to abstract away from specific objects. We develop a residual policy gradient method that is able to integrate knowledge across different abstraction levels in the class hierarchy. Our method results in improved sample efficiency and generalisation to unseen objects in commonsense games, but we also investigate failure modes, such as excessive noise in the extracted class knowledge or environments with little class structure.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Hate Speech and Cyberbullying Detection
MethodsBalanced Selection
