Success Probability of Exploration: a Concrete Analysis of Learning   Efficiency

Liangpeng Zhang; Ke Tang; Xin Yao

arXiv:1612.00882·cs.LG·December 6, 2016

Success Probability of Exploration: a Concrete Analysis of Learning Efficiency

Liangpeng Zhang, Ke Tang, Xin Yao

PDF

Open Access

TL;DR

This paper introduces a new analytical framework called the success probability of exploration in reinforcement learning, providing practical tools for evaluating exploration efficiency and predicting outcomes without running algorithms.

Contribution

It proposes a novel framework that answers key questions on exploration efficiency and offers a practical method for success probability evaluation in MDPs.

Findings

01

Framework effectively predicts exploration success probabilities

02

Empirical results validate the approach

03

Analysis helps understand exploration behaviors

Abstract

Exploration has been a crucial part of reinforcement learning, yet several important questions concerning exploration efficiency are still not answered satisfactorily by existing analytical frameworks. These questions include exploration parameter setting, situation analysis, and hardness of MDPs, all of which are unavoidable for practitioners. To bridge the gap between the theory and practice, we propose a new analytical framework called the success probability of exploration. We show that those important questions of exploration above can all be answered under our framework, and the answers provided by our framework meet the needs of practitioners better than the existing ones. More importantly, we introduce a concrete and practical approach to evaluating the success probabilities in certain MDPs without the need of actually running the learning algorithm. We then provide empirical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Machine Learning and Algorithms