Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Qinghua Liu, Praneeth Netrapalli, Csaba Szepesv\'ari, Chi Jin

TL;DR
This paper presents OMLE, a unified, efficient model-based algorithm that learns near-optimal policies in a broad class of partially observable sequential decision-making problems, including new challenging settings.
Contribution
Introduces OMLE, a generic algorithm combining optimism and maximum likelihood estimation, capable of handling diverse POMDPs and partially observable RL problems with polynomial sample complexity.
Findings
Learns near-optimal policies in various POMDPs and RL problems.
First sample complexity for observable POMDPs with continuous observations.
Effective in low-rank and SAIL condition problems.
Abstract
This paper introduces a simple efficient learning algorithms for general sequential decision making. The algorithm combines Optimism for exploration with Maximum Likelihood Estimation for model estimation, which is thus named OMLE. We prove that OMLE learns the near-optimal policies of an enormously rich class of sequential decision making problems in a polynomial number of samples. This rich class includes not only a majority of known tractable model-based Reinforcement Learning (RL) problems (such as tabular MDPs, factored MDPs, low witness rank problems, tabular weakly-revealing/observable POMDPs and multi-step decodable POMDPs), but also many new challenging RL problems especially in the partially observable setting that were not previously known to be tractable. Notably, the new problems addressed by this paper include (1) observable POMDPs with continuous observation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Supply Chain and Inventory Management · Auction Theory and Applications
