Kernel-based diffusion approximated Markov decision processes for   autonomous navigation and control on unstructured terrains

Junhong Xu; Kai Yin; Zheng Chen; Jason M. Gregory; Ethan A. Stump,; Lantao Liu

arXiv:2111.08748·cs.RO·February 8, 2024

Kernel-based diffusion approximated Markov decision processes for autonomous navigation and control on unstructured terrains

Junhong Xu, Kai Yin, Zheng Chen, Jason M. Gregory, Ethan A. Stump,, Lantao Liu

PDF

Open Access

TL;DR

This paper introduces a diffusion approximation method for continuous-state MDPs, enabling autonomous navigation in unstructured terrains without requiring fully known transition models, validated through simulations and real-world experiments.

Contribution

It presents a novel kernel-based diffusion approximation approach that relaxes the assumption of known transition models in MDPs for autonomous navigation.

Findings

01

Superior performance in obstacle avoidance and terrain navigation in simulations

02

Effective real-world deployment in indoor and outdoor environments

03

Efficient policy iteration using a kernel representation of the value function

Abstract

We propose a diffusion approximation method to the continuous-state Markov Decision Processes (MDPs) that can be utilized to address autonomous navigation and control in unstructured off-road environments. In contrast to most decision-theoretic planning frameworks that assume fully known state transition models, we design a method that eliminates such a strong assumption that is often extremely difficult to engineer in reality. We first take the second-order Taylor expansion of the value function. The Bellman optimality equation is then approximated by a partial differential equation, which only relies on the first and second moments of the transition model. By combining the kernel representation of the value function, we design an efficient policy iteration algorithm whose policy evaluation step can be represented as a linear system of equations characterized by a finite set of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Evacuation and Crowd Dynamics