A neurally plausible model learns successor representations in partially   observable environments

Eszter Vertes; Maneesh Sahani

arXiv:1906.09480·stat.ML·June 25, 2019·27 cites

A neurally plausible model learns successor representations in partially observable environments

Eszter Vertes, Maneesh Sahani

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a biologically plausible model that learns successor representations in noisy, partially observable environments, enabling efficient reinforcement learning when direct policy learning is challenging.

Contribution

The paper proposes a novel distributional successor features model that incorporates neural-like uncertainty representation for learning in noisy, partially observable settings.

Findings

01

Supports reinforcement learning in noisy environments

02

Enables rapid value computation and adaptation

03

Handles uncertainty in sensory observations

Abstract

Animals need to devise strategies to maximize returns while interacting with their environment based on incoming noisy sensory observations. Task-relevant states, such as the agent's location within an environment or the presence of a predator, are often not directly observable but must be inferred using available sensory information. Successor representations (SR) have been proposed as a middle-ground between model-based and model-free reinforcement learning strategies, allowing for fast value computation and rapid adaptation to changes in the reward function or goal locations. Indeed, recent studies suggest that features of neural responses are consistent with the SR framework. However, it is not clear how such representations might be learned and computed in partially observed, noisy environments. Here, we introduce a neurally plausible model using distributional successor features,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

evertes/distributional_SF
noneOfficial

Videos

A neurally plausible model learns successor representations in partially observable environments· youtube

Taxonomy

TopicsNeural dynamics and brain function · Reinforcement Learning in Robotics · Neural Networks and Applications