A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters
Long Hei Matthew Lam, Ramya Keerthy Thatikonda, Ehsan Shareghi

TL;DR
This paper investigates how the choice of symbolic solver affects the performance of LLM-based logical reasoning, revealing that translation quality significantly impacts reasoning accuracy across different solvers.
Contribution
It provides a systematic comparison of symbolic solvers in LLM reasoning, highlighting the impact of translation quality and solver choice on performance.
Findings
Performance varies by nearly 50% depending on the symbolic solver used.
Translation accuracy strongly correlates with reasoning correctness, especially with Prover9.
Different symbolic solvers present varying translation challenges affecting overall results.
Abstract
The emergence of Large Language Models (LLMs) has demonstrated promising progress in solving logical reasoning tasks effectively. Several recent approaches have proposed to change the role of the LLM from the reasoner into a translator between natural language statements and symbolic representations which are then sent to external symbolic solvers to resolve. This paradigm has established the current state-of-the-art result in logical reasoning (i.e., deductive reasoning). However, it remains unclear whether the variance in performance of these approaches stems from the methodologies employed or the specific symbolic solvers utilized. There is a lack of consistent comparison between symbolic solvers and how they influence the overall reported performance. This is important, as each symbolic solver also has its own input symbolic language, presenting varying degrees of challenge in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Law, AI, and Intellectual Property
