Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision

Xingwei Tan; Marco Valentino; Mahmud Akhter; Maria Liakata; Nikolaos Aletras

arXiv:2505.20415·cs.CL·September 19, 2025

Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision

Xingwei Tan, Marco Valentino, Mahmud Akhter, Maria Liakata, Nikolaos Aletras

PDF

Open Access 2 Datasets 1 Video

TL;DR

This paper introduces a novel method combining symbolic reasoning trajectories with Monte Carlo estimation and preference optimization to enhance logical reasoning and generalization in large language models.

Contribution

It proposes a scalable approach to synthesize symbolic reasoning trajectories and train models to improve reasoning reliability and out-of-domain generalization.

Findings

01

Significant performance improvements on FOLIO and LogicAsker benchmarks.

02

Enhanced out-of-domain generalization in claim verification tasks.

03

Effective leveraging of symbolic trajectories via Monte Carlo estimation.

Abstract

Large language models (LLMs) have shown strong performance in many reasoning benchmarks. However, recent studies have pointed to memorization, rather than generalization, as one of the leading causes for such performance. LLMs, in fact, are susceptible to content variations, demonstrating a lack of robust planning or symbolic abstractions supporting their reasoning process. To improve reliability, many attempts have been made to combine LLMs with symbolic methods. Nevertheless, existing approaches fail to effectively leverage symbolic representations due to the challenges involved in developing reliable and scalable verification mechanisms. In this paper, we propose to overcome such limitations by synthesizing high-quality symbolic reasoning trajectories with stepwise pseudo-labels at scale via Monte Carlo estimation. A Process Reward Model (PRM) can be efficiently trained based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision· underline

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference