Three Pathways to Neurosymbolic Reinforcement Learning with Interpretable Model and Policy Networks
Peter Graf, Patrick Emami

TL;DR
This paper explores three methods for creating interpretable, differentiable models in neurosymbolic reinforcement learning, balancing logic, simulation, and learning to enhance interpretability and learnability.
Contribution
It introduces three pathways for implementing neurosymbolic models that combine interpretability with differentiability in reinforcement learning settings.
Findings
Differentiability aids learning but reduces interpretability.
Mapping simulation data to logical predicates is complex.
Trade-offs exist between learnability and interpretability.
Abstract
Neurosymbolic AI combines the interpretability, parsimony, and explicit reasoning of classical symbolic approaches with the statistical learning of data-driven neural approaches. Models and policies that are simultaneously differentiable and interpretable may be key enablers of this marriage. This paper demonstrates three pathways to implementing such models and policies in a real-world reinforcement learning setting. Specifically, we study a broad class of neural networks that build interpretable semantics directly into their architecture. We reveal and highlight both the potential and the essential difficulties of combining logic, simulation, and learning. One lesson is that learning benefits from continuity and differentiability, but classical logic is discrete and non-differentiable. The relaxation to real-valued, differentiable representations presents a trade-off; the more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
