Neural Reward Machines

Elena Umili; Francesco Argenziano; Roberto Capobianco

arXiv:2408.08677·cs.LG·August 19, 2024

Neural Reward Machines

Elena Umili, Francesco Argenziano, Roberto Capobianco

PDF

Open Access 1 Repo

TL;DR

Neural Reward Machines (NRMs) are a neurosymbolic framework that enables reasoning and learning in non-markovian RL tasks by combining automata-based models with semi-supervised symbol grounding, outperforming traditional Deep RL methods.

Contribution

This paper introduces NRMs, a novel automata-based neurosymbolic approach for non-markovian RL that does not require prior knowledge of the symbol grounding function.

Findings

01

NRMs outperform Deep RL methods in non-symbolic environments.

02

NRMs can exploit high-level symbolic knowledge without explicit symbol grounding.

03

Proposed SSSG algorithm is 1000 times more efficient in analyzing temporal specifications.

Abstract

Non-markovian Reinforcement Learning (RL) tasks are very hard to solve, because agents must consider the entire history of state-action pairs to act rationally in the environment. Most works use symbolic formalisms (as Linear Temporal Logic or automata) to specify the temporally-extended task. These approaches only work in finite and discrete state environments or continuous problems for which a mapping between the raw state and a symbolic interpretation is known as a symbol grounding (SG) function. Here, we define Neural Reward Machines (NRM), an automata-based neurosymbolic framework that can be used for both reasoning and learning in non-symbolic non-markovian RL domains, which is based on the probabilistic relaxation of Moore Machines. We combine RL with semisupervised symbol grounding (SSSG) and we show that NRMs can exploit high-level symbolic knowledge in non-symbolic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

krlgroup/neuralrewardmachines
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces