Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Tennison Liu, Mihaela van der Schaar

TL;DR
This paper argues that for self-improving agents to truly advance, they need intrinsic metacognitive learning capabilities inspired by human cognition, involving self-assessment, planning, and reflection to enhance adaptability and scalability.
Contribution
The paper introduces a formal framework for intrinsic metacognitive learning in agents, analyzing existing methods and proposing directions for scalable, adaptive self-improvement.
Findings
Existing agents rely on fixed extrinsic metacognitive mechanisms.
Intrinsic metacognitive components are largely present but underutilized.
Addressing key challenges can enable sustained, generalized self-improvement.
Abstract
Self-improving agents aim to continuously acquire new capabilities with minimal supervision. However, current approaches face two key limitations: their self-improvement processes are often rigid, fail to generalize across tasks domains, and struggle to scale with increasing agent capabilities. We argue that effective self-improvement requires intrinsic metacognitive learning, defined as an agent's intrinsic ability to actively evaluate, reflect on, and adapt its own learning processes. Drawing inspiration from human metacognition, we introduce a formal framework comprising three components: metacognitive knowledge (self-assessment of capabilities, tasks, and learning strategies), metacognitive planning (deciding what and how to learn), and metacognitive evaluation (reflecting on learning experiences to improve future learning). Analyzing existing self-improving agents, we find they…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbodied and Extended Cognition · Social Robot Interaction and HRI · Reinforcement Learning in Robotics
