On the Reasoning Capacity of AI Models and How to Quantify It
Santosh Kumar Radha, Oktay Goktas

TL;DR
This paper introduces a phenomenological framework to analyze AI reasoning, revealing that current models often rely on memorization and pattern matching rather than genuine reasoning, and proposes new metrics for evaluation beyond accuracy.
Contribution
It develops a novel phenomenological approach with models and metrics to better understand and quantify the reasoning mechanisms of AI systems.
Findings
Models often rely on memorization and pattern matching rather than true reasoning.
Accuracy metrics can overstate a model's reasoning capabilities.
The framework enables strategy-based reliability assessment for AI deployment.
Abstract
Recent advances in Large Language Models (LLMs) have intensified the debate surrounding the fundamental nature of their reasoning capabilities. While achieving high performance on benchmarks such as GPQA and MMLU, these models exhibit limitations in more complex reasoning tasks, highlighting the need for more rigorous evaluation methodologies. We propose a novel phenomenological approach that goes beyond traditional accuracy metrics to probe the underlying mechanisms of model behavior, establishing a framework that could broadly impact how we analyze and understand AI systems. Using positional bias in multiple-choice reasoning tasks as a case study, we demonstrate how systematic perturbations can reveal fundamental aspects of model decision-making. To analyze these behaviors, we develop two complementary phenomenological models: a Probabilistic Mixture Model (PMM) that decomposes model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Semantic Web and Ontologies · Explainable Artificial Intelligence (XAI)
