Kernel Taylor-Based Value Function Approximation for Continuous-State   Markov Decision Processes

Junhong Xu; Kai Yin; Lantao Liu

arXiv:2006.02008·cs.RO·June 4, 2020

Kernel Taylor-Based Value Function Approximation for Continuous-State Markov Decision Processes

Junhong Xu, Kai Yin, Lantao Liu

PDF

Open Access

TL;DR

This paper introduces a kernel-based policy iteration method for continuous-state MDPs that does not require known transition models, using Taylor expansion and PDE approximation to improve planning efficiency.

Contribution

It presents a novel kernel Taylor-based approach that eliminates the need for explicit transition models in continuous-state MDPs, enabling more practical policy iteration.

Findings

01

Outperforms baseline methods in simulations

02

Efficient policy evaluation via linear systems

03

Effective in both simplified and realistic scenarios

Abstract

We propose a principled kernel-based policy iteration algorithm to solve the continuous-state Markov Decision Processes (MDPs). In contrast to most decision-theoretic planning frameworks, which assume fully known state transition models, we design a method that eliminates such a strong assumption, which is oftentimes extremely difficult to engineer in reality. To achieve this, we first apply the second-order Taylor expansion of the value function. The Bellman optimality equation is then approximated by a partial differential equation, which only relies on the first and second moments of the transition model. By combining the kernel representation of value function, we then design an efficient policy iteration algorithm whose policy evaluation step can be represented as a linear system of equations characterized by a finite set of supporting states. We have validated the proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Formal Methods in Verification · Simulation Techniques and Applications