Disentangling Logic: The Role of Context in Large Language Model   Reasoning Capabilities

Wenyue Hua; Kaijie Zhu; Lingyao Li; Lizhou Fan; Shuhang Lin; Mingyu; Jin; Haochen Xue; Zelong Li; JinDong Wang; Yongfeng Zhang

arXiv:2406.02787·cs.CL·June 6, 2024·1 cites

Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities

Wenyue Hua, Kaijie Zhu, Lingyao Li, Lizhou Fan, Shuhang Lin, Mingyu, Jin, Haochen Xue, Zelong Li, JinDong Wang, Yongfeng Zhang

PDF

Open Access

TL;DR

This paper investigates whether large language models genuinely reason or rely on context by comparing their performance on abstract versus contextualized logical problems across multiple domains.

Contribution

It introduces a systematic approach to disentangle reasoning from text understanding in LLMs using datasets of propositional logic problems with varying difficulty and context.

Findings

01

LLMs perform differently on abstract versus contextualized problems.

02

Fine-tuning on abstract logic can improve reasoning in contextual scenarios.

03

The study provides datasets and benchmarks for future reasoning evaluation.

Abstract

This study intends to systematically disentangle pure logic reasoning and text understanding by investigating the contrast across abstract and contextualized logical problems from a comprehensive set of domains. We explore whether LLMs demonstrate genuine reasoning capabilities across various domains when the underlying logical structure remains constant. We focus on two main questions (1) Can abstract logical problems alone accurately benchmark an LLM's reasoning ability in real-world scenarios, disentangled from contextual support in practical settings? (2) Does fine-tuning LLMs on abstract logic problem generalize to contextualized logic problems and vice versa? To investigate these questions, we focus on standard propositional logic, specifically propositional deductive and abductive logic reasoning. In particular, we construct instantiated datasets for deductive and abductive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies

MethodsSparse Evolutionary Training · Focus