Constructive Symbolic Reinforcement Learning via Intuitionistic Logic and Goal-Chaining Inference
Andrei T. Patrascu

TL;DR
This paper presents a new reinforcement learning framework that uses constructive logical inference and goal-chaining to ensure safe, interpretable, and efficient decision-making without probabilistic trial-and-error.
Contribution
It introduces a symbolic, logic-based reinforcement learning approach that guarantees valid actions through proof construction, contrasting with traditional probabilistic methods.
Findings
Achieves perfect safety with no invalid actions.
Demonstrates efficient convergence compared to Q-learning.
Provides interpretable and verifiable plans in gridworld environments.
Abstract
We introduce a novel learning and planning framework that replaces traditional reward-based optimisation with constructive logical inference. In our model, actions, transitions, and goals are represented as logical propositions, and decision-making proceeds by building constructive proofs under intuitionistic logic. This method ensures that state transitions and policies are accepted only when supported by verifiable preconditions -- eschewing probabilistic trial-and-error in favour of guaranteed logical validity. We implement a symbolic agent operating in a structured gridworld, where reaching a goal requires satisfying a chain of intermediate subgoals (e.g., collecting keys to open doors), each governed by logical constraints. Unlike conventional reinforcement learning agents, which require extensive exploration and suffer from unsafe or invalid transitions, our constructive agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, Reasoning, and Knowledge · AI-based Problem Solving and Planning · Constraint Satisfaction and Optimization
