TL;DR
This paper demonstrates that integrating predictive coding modules into meta-reinforcement learning enhances the learning of interpretable, Bayes-optimal belief representations, leading to better generalization and active information seeking in partially observable environments.
Contribution
The paper introduces a novel approach combining predictive coding with meta-RL to improve belief representation learning under partial observability, which was not previously achieved.
Findings
Meta-RL with predictive modules produces more interpretable belief states.
Predictive modules enable learning of optimal representations in active information seeking tasks.
Enhanced representations lead to improved generalization across tasks.
Abstract
Learning a compact representation of history is critical for planning and generalization in partially observable environments. While meta-reinforcement learning (RL) agents can attain near Bayes-optimal policies, they often fail to learn the compact, interpretable Bayes-optimal belief states. This representational inefficiency potentially limits the agent's adaptability and generalization capacity. Inspired by predictive coding in neuroscience--which suggests that the brain predicts sensory inputs as a neural implementation of Bayesian inference--and by auxiliary predictive objectives in deep RL, we investigate whether integrating self-supervised predictive coding modules into meta-RL can facilitate learning of Bayes-optimal representations. Through state machine simulation, we show that meta-RL with predictive modules consistently generates more interpretable representations that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
