Is your LLM trapped in a Mental Set? Investigative study on how mental   sets affect the reasoning capabilities of LLMs

Saiful Haq; Niyati Chhaya; Piyush Pandey; Pushpak Bhattacharya

arXiv:2501.11833·cs.CL·January 22, 2025

Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs

Saiful Haq, Niyati Chhaya, Piyush Pandey, Pushpak Bhattacharya

PDF

Open Access

TL;DR

This study investigates how mental sets, a concept from psychology, influence the reasoning abilities of large language models, revealing insights into their adaptability and problem-solving limitations.

Contribution

It introduces a novel evaluation framework incorporating cognitive psychology to assess LLMs' adaptability to mental sets in complex reasoning tasks.

Findings

01

LLMs exhibit difficulty overcoming mental sets in reasoning tasks.

02

Current benchmarks may not fully capture models' adaptability to unfamiliar situations.

03

Incorporating psychological concepts enhances understanding of LLM reasoning capabilities.

Abstract

In this paper, we present an investigative study on how Mental Sets influence the reasoning capabilities of LLMs. LLMs have excelled in diverse natural language processing (NLP) tasks, driven by advancements in parameter-efficient fine-tuning (PEFT) and emergent capabilities like in-context learning (ICL). For complex reasoning tasks, selecting the right model for PEFT or ICL is critical, often relying on scores on benchmarks such as MMLU, MATH, and GSM8K. However, current evaluation methods, based on metrics like F1 Score or reasoning chain assessments by larger models, overlook a key dimension: adaptability to unfamiliar situations and overcoming entrenched thinking patterns. In cognitive psychology, Mental Set refers to the tendency to persist with previously successful strategies, even when they become inefficient - a challenge for problem solving and reasoning. We compare the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Big Data and Business Intelligence · ERP Systems Implementation and Impact

MethodsSparse Evolutionary Training