Marti-5: A Mathematical Model of "Self in the World" as a First Step Toward Self-Awareness
Igor Pivovarov, Sergey Shumsky

TL;DR
This paper introduces a biologically inspired mathematical model of self-awareness, integrating 'what' and 'where' pathways to enable an agent to distinguish itself from the environment and learn purposeful behavior.
Contribution
It presents a novel mathematical model combining neural pathways to simulate self-awareness and demonstrates its effectiveness in reinforcement learning agents playing Atari games.
Findings
Agent successfully learned to play Pong and Breakout
Separating self from environment improves agent performance
Model suggests self-awareness could have evolved in living organisms
Abstract
The existence of 'what' and 'where' pathways of information processing in the brain was proposed almost 30 years ago, but there is still a lack of a clear mathematical model that could show how these pathways work together. We propose a biologically inspired mathematical model that uses this idea to identify and separate the self from the environment and then build and use a self-model for better predictions. This is a model of neocortical columns governed by the basal ganglia to make predictions and choose the next action, where some columns act as 'what' columns and others act as 'where' columns. Based on this model, we present a reinforcement learning agent that learns purposeful behavior in a virtual environment. We evaluate the agent on the Atari games Pong and Breakout, where it successfully learns to play. We conclude that the ability to separate the self from the environment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Science and Education Research · Reinforcement Learning in Robotics · Embodied and Extended Cognition
