Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Nils Grandien, Quentin Delfosse, Kristian Kersting

TL;DR
This paper introduces an end-to-end neurosymbolic reinforcement learning agent called SCoBot that combines deep learning and symbolic reasoning to improve interpretability and generalization in Atari games.
Contribution
It presents the first end-to-end trained SCoBot architecture that extracts object-centric representations from raw pixels for interpretable RL, bridging neural and symbolic AI.
Findings
Demonstrates the feasibility of end-to-end training of SCoBots on Atari games.
Shows that object-centric representations improve interpretability of RL agents.
Provides a framework for evaluating components of neurosymbolic RL systems.
Abstract
Deep reinforcement learning (RL) agents rely on shortcut learning, preventing them from generalizing to slightly different environments. To address this problem, symbolic method, that use object-centric states, have been developed. However, comparing these methods to deep agents is not fair, as these last operate from raw pixel-based states. In this work, we instantiate the symbolic SCoBots framework. SCoBots decompose RL tasks into intermediate, interpretable representations, culminating in action decisions based on a comprehensible set of object-centric relational concepts. This architecture aids in demystifying agent decisions. By explicitly learning to extract object-centric representations from raw states, object-centric RL, and policy distillation via rule extraction, this work places itself within the neurosymbolic AI paradigm, blending the strengths of neural networks with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and Computational Modeling · Neural Networks and Applications · Data Stream Mining Techniques
MethodsSparse Evolutionary Training
