Kernel-based diffusion approximated Markov decision processes for autonomous navigation and control on unstructured terrains
Junhong Xu, Kai Yin, Zheng Chen, Jason M. Gregory, Ethan A. Stump,, Lantao Liu

TL;DR
This paper introduces a diffusion approximation method for continuous-state MDPs, enabling autonomous navigation in unstructured terrains without requiring fully known transition models, validated through simulations and real-world experiments.
Contribution
It presents a novel kernel-based diffusion approximation approach that relaxes the assumption of known transition models in MDPs for autonomous navigation.
Findings
Superior performance in obstacle avoidance and terrain navigation in simulations
Effective real-world deployment in indoor and outdoor environments
Efficient policy iteration using a kernel representation of the value function
Abstract
We propose a diffusion approximation method to the continuous-state Markov Decision Processes (MDPs) that can be utilized to address autonomous navigation and control in unstructured off-road environments. In contrast to most decision-theoretic planning frameworks that assume fully known state transition models, we design a method that eliminates such a strong assumption that is often extremely difficult to engineer in reality. We first take the second-order Taylor expansion of the value function. The Bellman optimality equation is then approximated by a partial differential equation, which only relies on the first and second moments of the transition model. By combining the kernel representation of the value function, we design an efficient policy iteration algorithm whose policy evaluation step can be represented as a linear system of equations characterized by a finite set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Evacuation and Crowd Dynamics
