Towards A Unified Policy Abstraction Theory and Representation Learning   Approach in Markov Decision Processes

Min Zhang; Hongyao Tang; Jianye Hao; Yan Zheng

arXiv:2209.07696·cs.LG·September 19, 2022·1 cites

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes

Min Zhang, Hongyao Tang, Jianye Hao, Yan Zheng

PDF

Open Access

TL;DR

This paper introduces a unified theory of policy abstraction and a deep metric learning approach for policy representation in Markov Decision Processes, addressing challenges of large policy spaces and improving policy evaluation and optimization.

Contribution

It proposes the first unified policy abstraction theory with three abstraction types and corresponding metrics, along with a deep metric learning method for policy representation.

Findings

01

Policy abstraction influences downstream learning performance.

02

Influence-irrelevance abstraction is generally preferred.

03

Proposed metrics effectively characterize policy differences.

Abstract

Lying on the heart of intelligent decision-making systems, how policy is represented and optimized is a fundamental problem. The root challenge in this problem is the large scale and the high complexity of policy space, which exacerbates the difficulty of policy learning especially in real-world scenarios. Towards a desirable surrogate policy space, recently policy representation in a low-dimensional latent space has shown its potential in improving both the evaluation and optimization of policy. The key question involved in these studies is by what criterion we should abstract the policy space for desired compression and generalization. However, both the theory on policy abstraction and the methodology on policy representation learning are less studied in the literature. In this work, we make very first efforts to fill up the vacancy. First, we propose a unified policy abstraction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Cities and Technologies · Fuel Cells and Related Materials