Neural Predictive Belief Representations
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A., Pires, R\'emi Munos

TL;DR
This paper explores neural methods for learning belief state representations in partially observable environments, demonstrating that contrastive predictive coding can encode both state and uncertainty, aiding decision making.
Contribution
It shows that neural architectures using CPC can effectively learn belief representations, including uncertainty, in complex environments, advancing unsupervised learning for partially observable domains.
Findings
All methods learned belief representations encoding environment state.
CPC with multi-step prediction and action-conditioning improves accuracy.
Representations include both state information and uncertainty.
Abstract
Unsupervised representation learning has succeeded with excellent results in many applications. It is an especially powerful tool to learn a good representation of environments with partial or noisy observations. In partially observable domains it is important for the representation to encode a belief state, a sufficient statistic of the observations seen so far. In this paper, we investigate whether it is possible to learn such a belief representation using modern neural architectures. Specifically, we focus on one-step frame prediction and two variants of contrastive predictive coding (CPC) as the objective functions to learn the representations. To evaluate these learned representations, we test how well they can predict various pieces of information about the underlying state of the environment, e.g., position of the agent in a 3D maze. We show that all three methods are able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsInfoNCE · Contrastive Predictive Coding
