Provably Efficient Representation Learning with Tractable Planning in   Low-Rank POMDP

Jiacheng Guo; Zihao Li; Huazheng Wang; Mengdi Wang; Zhuoran Yang,; Xuezhou Zhang

arXiv:2306.12356·cs.LG·June 22, 2023

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Jiacheng Guo, Zihao Li, Huazheng Wang, Mengdi Wang, Zhuoran Yang,, Xuezhou Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces a computationally efficient algorithm for representation learning in certain classes of POMDPs, enabling more effective exploration and planning with provable guarantees.

Contribution

It presents the first efficient algorithm for representation learning in decodable and gamma-observable POMDPs that combines MLE and OFU, relying only on supervised learning oracles.

Findings

01

Achieves efficient sample complexity in decodable POMDPs.

02

Extends the algorithm to gamma-observable POMDPs.

03

Provides a practical approach for tractable planning in complex environments.

Abstract

In this paper, we study representation learning in partially observable Markov Decision Processes (POMDPs), where the agent learns a decoder function that maps a series of high-dimensional raw observations to a compact representation and uses it for more efficient exploration and planning. We focus our attention on the sub-classes of \textit{ $γ$ -observable} and \textit{decodable POMDPs}, for which it has been shown that statistically tractable learning is possible, but there has not been any computationally efficient algorithm. We first present an algorithm for decodable POMDPs that combines maximum likelihood estimation (MLE) and optimism in the face of uncertainty (OFU) to perform representation learning and achieve efficient sample complexity, while only calling supervised learning computational oracles. We then show how to adapt this algorithm to also work in the broader…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference

MethodsFocus