Neural Predictive Belief Representations

Zhaohan Daniel Guo; Mohammad Gheshlaghi Azar; Bilal Piot; Bernardo A.; Pires; R\'emi Munos

arXiv:1811.06407·cs.LG·August 20, 2019·47 cites

Neural Predictive Belief Representations

Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A., Pires, R\'emi Munos

PDF

Open Access

TL;DR

This paper explores neural methods for learning belief state representations in partially observable environments, demonstrating that contrastive predictive coding can encode both state and uncertainty, aiding decision making.

Contribution

It shows that neural architectures using CPC can effectively learn belief representations, including uncertainty, in complex environments, advancing unsupervised learning for partially observable domains.

Findings

01

All methods learned belief representations encoding environment state.

02

CPC with multi-step prediction and action-conditioning improves accuracy.

03

Representations include both state information and uncertainty.

Abstract

Unsupervised representation learning has succeeded with excellent results in many applications. It is an especially powerful tool to learn a good representation of environments with partial or noisy observations. In partially observable domains it is important for the representation to encode a belief state, a sufficient statistic of the observations seen so far. In this paper, we investigate whether it is possible to learn such a belief representation using modern neural architectures. Specifically, we focus on one-step frame prediction and two variants of contrastive predictive coding (CPC) as the objective functions to learn the representations. To evaluate these learned representations, we test how well they can predict various pieces of information about the underlying state of the environment, e.g., position of the agent in a 3D maze. We show that all three methods are able to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)

MethodsInfoNCE · Contrastive Predictive Coding