Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
Min Zhang, Hongyao Tang, Jianye Hao, Yan Zheng

TL;DR
This paper introduces a unified theory of policy abstraction and a deep metric learning approach for policy representation in Markov Decision Processes, addressing challenges of large policy spaces and improving policy evaluation and optimization.
Contribution
It proposes the first unified policy abstraction theory with three abstraction types and corresponding metrics, along with a deep metric learning method for policy representation.
Findings
Policy abstraction influences downstream learning performance.
Influence-irrelevance abstraction is generally preferred.
Proposed metrics effectively characterize policy differences.
Abstract
Lying on the heart of intelligent decision-making systems, how policy is represented and optimized is a fundamental problem. The root challenge in this problem is the large scale and the high complexity of policy space, which exacerbates the difficulty of policy learning especially in real-world scenarios. Towards a desirable surrogate policy space, recently policy representation in a low-dimensional latent space has shown its potential in improving both the evaluation and optimization of policy. The key question involved in these studies is by what criterion we should abstract the policy space for desired compression and generalization. However, both the theory on policy abstraction and the methodology on policy representation learning are less studied in the literature. In this work, we make very first efforts to fill up the vacancy. First, we propose a unified policy abstraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Cities and Technologies · Fuel Cells and Related Materials
