Language Models can be Logical Solvers
Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen,, Dongyan Zhao, Weizhu Chen

TL;DR
This paper introduces LoGiPT, a language model that directly emulates logical solvers' reasoning, reducing parsing errors and outperforming existing solver-augmented models on deductive reasoning tasks.
Contribution
LoGiPT is a novel model trained to emulate logical solvers directly, bypassing parsing errors and improving reasoning accuracy over prior approaches.
Findings
LoGiPT outperforms state-of-the-art solver-augmented LMs.
LoGiPT surpasses few-shot prompting methods on deductive reasoning datasets.
The model demonstrates improved robustness by avoiding parsing errors.
Abstract
Logical reasoning is a fundamental aspect of human intelligence and a key component of tasks like problem-solving and decision-making. Recent advancements have enabled Large Language Models (LLMs) to potentially exhibit reasoning capabilities, but complex logical reasoning remains a challenge. The state-of-the-art, solver-augmented language models, use LLMs to parse natural language logical questions into symbolic representations first and then adopt external logical solvers to take in the symbolic representations and output the answers. Despite their impressive performance, any parsing errors will inevitably result in the failure of the execution of the external logical solver and no answer to the logical questions. In this paper, we introduce LoGiPT, a novel language model that directly emulates the reasoning processes of logical solvers and bypasses the parsing errors by learning to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗jzfeng/LoGiPT-vicuna-13b-v1.5-16k-proofwritermodel· 5 dl· ♡ 25 dl♡ 2
- 🤗jzfeng/LoGiPT-CodeLlama-13b-hf-proofwritermodel· 1 dl· ♡ 11 dl♡ 1
- 🤗jzfeng/LoGiPT-CodeLlama-13b-Instruct-hf-proofwritermodel· 6 dl· ♡ 16 dl♡ 1
- 🤗jzfeng/LoGiPT-vicuna-13b-v1.5-16k-prontoqamodel· 2 dl· ♡ 22 dl♡ 2
- 🤗jzfeng/LoGiPT-CodeLlama-13b-hf-prontoqamodel· 1 dl· ♡ 11 dl♡ 1
- 🤗jzfeng/LoGiPT-CodeLlama-13b-Instruct-hf-prontoqamodel· 1 dl· ♡ 11 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsMulti-Head Attention · Attention Is All You Need · Residual Connection · Byte Pair Encoding · Dropout · Softmax · Linear Layer · Adam · Label Smoothing · Absolute Position Encodings
