A neurally plausible model learns successor representations in partially observable environments
Eszter Vertes, Maneesh Sahani

TL;DR
This paper introduces a biologically plausible model that learns successor representations in noisy, partially observable environments, enabling efficient reinforcement learning when direct policy learning is challenging.
Contribution
The paper proposes a novel distributional successor features model that incorporates neural-like uncertainty representation for learning in noisy, partially observable settings.
Findings
Supports reinforcement learning in noisy environments
Enables rapid value computation and adaptation
Handles uncertainty in sensory observations
Abstract
Animals need to devise strategies to maximize returns while interacting with their environment based on incoming noisy sensory observations. Task-relevant states, such as the agent's location within an environment or the presence of a predator, are often not directly observable but must be inferred using available sensory information. Successor representations (SR) have been proposed as a middle-ground between model-based and model-free reinforcement learning strategies, allowing for fast value computation and rapid adaptation to changes in the reward function or goal locations. Indeed, recent studies suggest that features of neural responses are consistent with the SR framework. However, it is not clear how such representations might be learned and computed in partially observed, noisy environments. Here, we introduce a neurally plausible model using distributional successor features,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
A neurally plausible model learns successor representations in partially observable environments· youtube
Taxonomy
TopicsNeural dynamics and brain function · Reinforcement Learning in Robotics · Neural Networks and Applications
