Learning States Representations in POMDP

Gabriella Contardo; Ludovic Denoyer; Thierry Artieres and; Patrick Gallinari

arXiv:1312.6042·cs.LG·June 18, 2014·ICLR·2 cites

Learning States Representations in POMDP

Gabriella Contardo, Ludovic Denoyer, Thierry Artieres and, Patrick Gallinari

PDF

Open Access

TL;DR

This paper introduces a method for learning latent state representations in partially observable Markov decision processes to improve policy learning from limited observations.

Contribution

It presents a novel approach to encode partial observations into a latent space for better policy optimization in POMDPs.

Findings

01

Latent representations improve policy accuracy in POMDPs.

02

The method outperforms existing approaches on benchmark tasks.

03

Efficient learning of states from partial data is demonstrated.

Abstract

We propose to deal with sequential processes where only partial observations are available by learning a latent representation space on which policies may be accurately learned.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Data Stream Mining Techniques · Reinforcement Learning in Robotics