Neural Reward Machines
Elena Umili, Francesco Argenziano, Roberto Capobianco

TL;DR
Neural Reward Machines (NRMs) are a neurosymbolic framework that enables reasoning and learning in non-markovian RL tasks by combining automata-based models with semi-supervised symbol grounding, outperforming traditional Deep RL methods.
Contribution
This paper introduces NRMs, a novel automata-based neurosymbolic approach for non-markovian RL that does not require prior knowledge of the symbol grounding function.
Findings
NRMs outperform Deep RL methods in non-symbolic environments.
NRMs can exploit high-level symbolic knowledge without explicit symbol grounding.
Proposed SSSG algorithm is 1000 times more efficient in analyzing temporal specifications.
Abstract
Non-markovian Reinforcement Learning (RL) tasks are very hard to solve, because agents must consider the entire history of state-action pairs to act rationally in the environment. Most works use symbolic formalisms (as Linear Temporal Logic or automata) to specify the temporally-extended task. These approaches only work in finite and discrete state environments or continuous problems for which a mapping between the raw state and a symbolic interpretation is known as a symbol grounding (SG) function. Here, we define Neural Reward Machines (NRM), an automata-based neurosymbolic framework that can be used for both reasoning and learning in non-symbolic non-markovian RL domains, which is based on the probabilistic relaxation of Moore Machines. We combine RL with semisupervised symbol grounding (SSSG) and we show that NRMs can exploit high-level symbolic knowledge in non-symbolic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces
