Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens

Ziyang Ma; Qingyue Yuan; Zhenglin Wang; Deyu Zhou

arXiv:2506.08410·cs.CL·October 31, 2025

Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens

Ziyang Ma, Qingyue Yuan, Zhenglin Wang, Deyu Zhou

PDF

Open Access 1 Video

TL;DR

This paper introduces AutoMeco, a framework for benchmarking LLMs' meta-cognitive abilities, and proposes MIRA, a strategy to enhance these evaluations without additional training, demonstrating improved assessment on reasoning datasets.

Contribution

The paper presents AutoMeco for systematic meta-cognition evaluation and introduces MIRA to improve lens effectiveness without training, advancing LLM self-assessment methods.

Findings

01

AutoMeco effectively benchmarks meta-cognition in LLMs.

02

MIRA enhances the accuracy of meta-cognition evaluation.

03

Meta-cognition abilities are better assessed with MIRA on reasoning tasks.

Abstract

Previous research has primarily focused on the cognitive error detection capabilities of Large Language Models (LLMs), often prompting them to analyze mistakes in reasoning chains. However, few studies have examined the meta-cognitive abilities of LLMs (e.g., their self-awareness of step errors), which are crucial for their reliability. While studies on LLM self-evaluation present some measures, such as perplexity, which can reflect the answer correctness and be viewed as the lens of meta-cognition, they lack step-level analysis and adaptation. This paper studies the evaluation of LLM meta-cognition using the current lenses and how to improve these lenses. Specifically, we propose AutoMeco, an Automated Meta-cognition Evaluation framework for benchmarking the existing lenses. Furthermore, a training-free Markovian Intrinsic Reward Adjustment strategy, MIRA, is proposed to boost current…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens· underline

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education