Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning

Xuan Zhang; Zhijian Zhou; Weidi Xu; Yanting Miao; Chao Qu; Yuan Qi

arXiv:2508.16524·cs.AI·August 25, 2025

Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning

Xuan Zhang, Zhijian Zhou, Weidi Xu, Yanting Miao, Chao Qu, Yuan Qi

PDF

TL;DR

This paper introduces a diffusion-based neuro-symbolic reasoning framework that effectively learns logical constraints and improves symbolic reasoning in neural networks through a two-stage training process and policy optimization.

Contribution

It presents a novel diffusion reasoning approach with a two-stage training strategy and a Markov decision process formulation for enforcing logical constraints.

Findings

01

Achieves high accuracy on symbolic reasoning benchmarks.

02

Ensures logical consistency in neural network outputs.

03

Outperforms existing methods in Sudoku and maze tasks.

Abstract

Enabling neural networks to learn complex logical constraints and fulfill symbolic reasoning is a critical challenge. Bridging this gap often requires guiding the neural network's output distribution to move closer to the symbolic constraints. While diffusion models have shown remarkable generative capability across various domains, we employ the powerful architecture to perform neuro-symbolic learning and solve logical puzzles. Our diffusion-based pipeline adopts a two-stage training strategy: the first stage focuses on cultivating basic reasoning abilities, while the second emphasizes systematic learning of logical constraints. To impose hard constraints on neural outputs in the second stage, we formulate the diffusion reasoner as a Markov decision process and innovatively fine-tune it with an improved proximal policy optimization algorithm. We utilize a rule-based reward signal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.