Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture
Rintaro Ando

TL;DR
This paper introduces the EG-MRSI framework, a theoretical architecture combining metacognition, emotion-driven motivation, and recursive self-improvement, with formal safety and learning metrics for developing safe, open-ended AGI.
Contribution
It presents the first formal theoretical foundation for an emotion-gradient metacognitive recursive self-improvement architecture, integrating safety, intrinsic motivation, and semantic learning metrics.
Findings
Defines the initial agent configuration and emotion-gradient dynamics.
Introduces quantifiable metrics: Meaning Density and Meaning Conversion Efficiency.
Establishes a reinforcement-compatible optimization objective for agent development.
Abstract
We present the Emotion-Gradient Metacognitive Recursive Self-Improvement (EG-MRSI) framework, a novel architecture that integrates introspective metacognition, emotion-based intrinsic motivation, and recursive self-modification into a unified theoretical system. The framework is explicitly capable of overwriting its own learning algorithm under formally bounded risk. Building upon the Noise-to-Meaning RSI (N2M-RSI) foundation, EG-MRSI introduces a differentiable intrinsic reward function driven by confidence, error, novelty, and cumulative success. This signal regulates both a metacognitive mapping and a self-modification operator constrained by provable safety mechanisms. We formally define the initial agent configuration, emotion-gradient dynamics, and RSI trigger conditions, and derive a reinforcement-compatible optimization objective that guides the agent's development trajectory.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Computing and Networks · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning
