On the Reasoning Capacity of AI Models and How to Quantify It

Santosh Kumar Radha; Oktay Goktas

arXiv:2501.13833·cs.AI·January 24, 2025·2 cites

On the Reasoning Capacity of AI Models and How to Quantify It

Santosh Kumar Radha, Oktay Goktas

PDF

Open Access

TL;DR

This paper introduces a phenomenological framework to analyze AI reasoning, revealing that current models often rely on memorization and pattern matching rather than genuine reasoning, and proposes new metrics for evaluation beyond accuracy.

Contribution

It develops a novel phenomenological approach with models and metrics to better understand and quantify the reasoning mechanisms of AI systems.

Findings

01

Models often rely on memorization and pattern matching rather than true reasoning.

02

Accuracy metrics can overstate a model's reasoning capabilities.

03

The framework enables strategy-based reliability assessment for AI deployment.

Abstract

Recent advances in Large Language Models (LLMs) have intensified the debate surrounding the fundamental nature of their reasoning capabilities. While achieving high performance on benchmarks such as GPQA and MMLU, these models exhibit limitations in more complex reasoning tasks, highlighting the need for more rigorous evaluation methodologies. We propose a novel phenomenological approach that goes beyond traditional accuracy metrics to probe the underlying mechanisms of model behavior, establishing a framework that could broadly impact how we analyze and understand AI systems. Using positional bias in multiple-choice reasoning tasks as a case study, we demonstrate how systematic perturbations can reveal fundamental aspects of model decision-making. To analyze these behaviors, we develop two complementary phenomenological models: a Probabilistic Mixture Model (PMM) that decomposes model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Semantic Web and Ontologies · Explainable Artificial Intelligence (XAI)