Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
Wenyue Hua, Kaijie Zhu, Lingyao Li, Lizhou Fan, Shuhang Lin, Mingyu, Jin, Haochen Xue, Zelong Li, JinDong Wang, Yongfeng Zhang

TL;DR
This paper investigates whether large language models genuinely reason or rely on context by comparing their performance on abstract versus contextualized logical problems across multiple domains.
Contribution
It introduces a systematic approach to disentangle reasoning from text understanding in LLMs using datasets of propositional logic problems with varying difficulty and context.
Findings
LLMs perform differently on abstract versus contextualized problems.
Fine-tuning on abstract logic can improve reasoning in contextual scenarios.
The study provides datasets and benchmarks for future reasoning evaluation.
Abstract
This study intends to systematically disentangle pure logic reasoning and text understanding by investigating the contrast across abstract and contextualized logical problems from a comprehensive set of domains. We explore whether LLMs demonstrate genuine reasoning capabilities across various domains when the underlying logical structure remains constant. We focus on two main questions (1) Can abstract logical problems alone accurately benchmark an LLM's reasoning ability in real-world scenarios, disentangled from contextual support in practical settings? (2) Does fine-tuning LLMs on abstract logic problem generalize to contextualized logic problems and vice versa? To investigate these questions, we focus on standard propositional logic, specifically propositional deductive and abductive logic reasoning. In particular, we construct instantiated datasets for deductive and abductive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies
MethodsSparse Evolutionary Training · Focus
