Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery

Zhipeng Zhang; Xiongfei Su; Kai Li

arXiv:2601.20193·cs.LG·March 24, 2026

Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery

Zhipeng Zhang, Xiongfei Su, Kai Li

PDF

Open Access

TL;DR

This paper introduces a meta-cognitive reinforcement learning framework that enables agents to assess and regulate their learning process based on internal reliability signals, improving robustness against reward corruption.

Contribution

It proposes a novel meta-trust variable driven by VPES to modulate learning and enable recovery, enhancing robustness in noisy environments.

Findings

01

Higher average returns in reward-corrupted benchmarks

02

Significant reduction in late-stage training failures

03

Effective self-regulation of learning process

Abstract

Robust reinforcement learning methods typically focus on suppressing unreliable experiences or corrupted rewards, but they lack the ability to reason about the reliability of their own learning process. As a result, such methods often either overreact to noise by becoming overly conservative or fail catastrophically when uncertainty accumulates. In this work, we propose a meta-cognitive reinforcement learning framework that enables an agent to assess, regulate, and recover its learning behavior based on internally estimated reliability signals. The proposed method introduces a meta-trust variable driven by Value Prediction Error Stability (VPES), which modulates learning dynamics via fail-safe regulation and gradual trust recovery. Experiments on continuous-control benchmarks with reward corruption demonstrate that recovery-enabled meta-cognitive control achieves higher average…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)