Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens
Ziyang Ma, Qingyue Yuan, Zhenglin Wang, Deyu Zhou

TL;DR
This paper introduces AutoMeco, a framework for benchmarking LLMs' meta-cognitive abilities, and proposes MIRA, a strategy to enhance these evaluations without additional training, demonstrating improved assessment on reasoning datasets.
Contribution
The paper presents AutoMeco for systematic meta-cognition evaluation and introduces MIRA to improve lens effectiveness without training, advancing LLM self-assessment methods.
Findings
AutoMeco effectively benchmarks meta-cognition in LLMs.
MIRA enhances the accuracy of meta-cognition evaluation.
Meta-cognition abilities are better assessed with MIRA on reasoning tasks.
Abstract
Previous research has primarily focused on the cognitive error detection capabilities of Large Language Models (LLMs), often prompting them to analyze mistakes in reasoning chains. However, few studies have examined the meta-cognitive abilities of LLMs (e.g., their self-awareness of step errors), which are crucial for their reliability. While studies on LLM self-evaluation present some measures, such as perplexity, which can reflect the answer correctness and be viewed as the lens of meta-cognition, they lack step-level analysis and adaptation. This paper studies the evaluation of LLM meta-cognition using the current lenses and how to improve these lenses. Specifically, we propose AutoMeco, an Automated Meta-cognition Evaluation framework for benchmarking the existing lenses. Furthermore, a training-free Markovian Intrinsic Reward Adjustment strategy, MIRA, is proposed to boost current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
