Interpretable end-to-end Neurosymbolic Reinforcement Learning agents

Nils Grandien; Quentin Delfosse; Kristian Kersting

arXiv:2410.14371·cs.AI·November 4, 2025

Interpretable end-to-end Neurosymbolic Reinforcement Learning agents

Nils Grandien, Quentin Delfosse, Kristian Kersting

PDF

Open Access

TL;DR

This paper introduces an end-to-end neurosymbolic reinforcement learning agent called SCoBot that combines deep learning and symbolic reasoning to improve interpretability and generalization in Atari games.

Contribution

It presents the first end-to-end trained SCoBot architecture that extracts object-centric representations from raw pixels for interpretable RL, bridging neural and symbolic AI.

Findings

01

Demonstrates the feasibility of end-to-end training of SCoBots on Atari games.

02

Shows that object-centric representations improve interpretability of RL agents.

03

Provides a framework for evaluating components of neurosymbolic RL systems.

Abstract

Deep reinforcement learning (RL) agents rely on shortcut learning, preventing them from generalizing to slightly different environments. To address this problem, symbolic method, that use object-centric states, have been developed. However, comparing these methods to deep agents is not fair, as these last operate from raw pixel-based states. In this work, we instantiate the symbolic SCoBots framework. SCoBots decompose RL tasks into intermediate, interpretable representations, culminating in action decisions based on a comprehensible set of object-centric relational concepts. This architecture aids in demystifying agent decisions. By explicitly learning to extract object-centric representations from raw states, object-centric RL, and policy distillation via rule extraction, this work places itself within the neurosymbolic AI paradigm, blending the strengths of neural networks with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling · Neural Networks and Applications · Data Stream Mining Techniques

MethodsSparse Evolutionary Training