Meta-R1: Empowering Large Reasoning Models with Metacognition
Haonan Dong, Haoran Ye, Wenhao Zhu, Kehan Jiang, Guojie Song

TL;DR
Meta-R1 introduces a meta-cognitive framework to large reasoning models, significantly improving their performance, efficiency, and adaptability by enabling explicit thinking about their reasoning process.
Contribution
It is the first systematic framework that incorporates explicit metacognitive capabilities into large reasoning models, inspired by cognitive science principles.
Findings
Surpasses state-of-the-art by up to 27.3% on benchmarks.
Reduces token consumption by 15.7% to 32.7%.
Maintains robust performance across datasets and models.
Abstract
Large Reasoning Models (LRMs) demonstrate remarkable capabilities on complex tasks, exhibiting emergent, human-like thinking patterns. Despite their advances, we identify a fundamental limitation: current LRMs lack a dedicated meta-level cognitive system-an essential faculty in human cognition that enables "thinking about thinking". This absence leaves their emergent abilities uncontrollable (non-adaptive reasoning), unreliable (intermediate error), and inflexible (lack of a clear methodology). To address this gap, we introduce Meta-R1, a systematic and generic framework that endows LRMs with explicit metacognitive capabilities. Drawing on principles from cognitive science, Meta-R1 decomposes the reasoning process into distinct object-level and meta-level components, orchestrating proactive planning, online regulation, and adaptive early stopping within a cascaded framework. Experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
