Success Probability of Exploration: a Concrete Analysis of Learning Efficiency
Liangpeng Zhang, Ke Tang, Xin Yao

TL;DR
This paper introduces a new analytical framework called the success probability of exploration in reinforcement learning, providing practical tools for evaluating exploration efficiency and predicting outcomes without running algorithms.
Contribution
It proposes a novel framework that answers key questions on exploration efficiency and offers a practical method for success probability evaluation in MDPs.
Findings
Framework effectively predicts exploration success probabilities
Empirical results validate the approach
Analysis helps understand exploration behaviors
Abstract
Exploration has been a crucial part of reinforcement learning, yet several important questions concerning exploration efficiency are still not answered satisfactorily by existing analytical frameworks. These questions include exploration parameter setting, situation analysis, and hardness of MDPs, all of which are unavoidable for practitioners. To bridge the gap between the theory and practice, we propose a new analytical framework called the success probability of exploration. We show that those important questions of exploration above can all be answered under our framework, and the answers provided by our framework meet the needs of practitioners better than the existing ones. More importantly, we introduce a concrete and practical approach to evaluating the success probabilities in certain MDPs without the need of actually running the learning algorithm. We then provide empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Machine Learning and Algorithms
