Leveraging class abstraction for commonsense reinforcement learning via   residual policy gradient methods

Niklas H\"opner; Ilaria Tiddi; Herke van Hoof

arXiv:2201.12126·cs.AI·May 3, 2022·1 cites

Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods

Niklas H\"opner, Ilaria Tiddi, Herke van Hoof

PDF

Open Access 1 Repo

TL;DR

This paper introduces a residual policy gradient method that leverages class hierarchies from knowledge graphs to improve reinforcement learning in knowledge-rich environments, enhancing sample efficiency and generalization.

Contribution

It presents a novel approach to incorporate class abstraction from knowledge graphs into RL via residual policy gradients, addressing challenges in leveraging unstructured knowledge.

Findings

01

Improved sample efficiency in commonsense games.

02

Enhanced generalization to unseen objects.

03

Identified limitations due to noisy class knowledge.

Abstract

Enabling reinforcement learning (RL) agents to leverage a knowledge base while learning from experience promises to advance RL in knowledge intensive domains. However, it has proven difficult to leverage knowledge that is not manually tailored to the environment. We propose to use the subclass relationships present in open-source knowledge graphs to abstract away from specific objects. We develop a residual policy gradient method that is able to integrate knowledge across different abstraction levels in the class hierarchy. Our method results in improved sample efficiency and generalisation to unseen objects in commonsense games, but we also investigate failure modes, such as excessive noise in the extracted class knowledge or environments with little class structure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NikeHop/CSRL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Hate Speech and Cyberbullying Detection

MethodsBalanced Selection