Automated mapping of virtual environments with visual predictive coding
James Gornet, Matthew Thomson

TL;DR
This paper demonstrates that predictive coding with neural networks can automatically construct internal spatial maps from visual data, enabling navigation and potentially extending to other sensory modalities.
Contribution
It introduces a predictive coding framework for constructing cognitive maps in virtual environments using self-attention neural networks, generalizing spatial mapping.
Findings
The agent learns to predict next images while constructing environment representations.
The internal map reflects distances and supports vector navigation.
Predictive coding can extend to auditory, tactile, and linguistic mapping.
Abstract
Humans construct internal cognitive maps of their environment directly from sensory inputs without access to a system of explicit coordinates or distance measurements. While machine learning algorithms like SLAM utilize specialized visual inference procedures to identify visual features and construct spatial maps from visual and odometry data, the general nature of cognitive maps in the brain suggests a unified mapping algorithmic strategy that can generalize to auditory, tactile, and linguistic inputs. Here, we demonstrate that predictive coding provides a natural and versatile neural network algorithm for constructing spatial maps using sensory data. We introduce a framework in which an agent navigates a virtual environment while engaging in visual predictive coding using a self-attention-equipped convolutional neural network. While learning a next image prediction task, the agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Tactile and Sensory Interactions
