Safe Learning for Uncertainty-Aware Planning via Interval MDP   Abstraction

Jesse Jiang; Ye Zhao; Samuel Coogan

arXiv:2202.01358·eess.SY·May 30, 2022

Safe Learning for Uncertainty-Aware Planning via Interval MDP Abstraction

Jesse Jiang, Ye Zhao, Samuel Coogan

PDF

TL;DR

This paper presents an abstraction-based method for safe, uncertainty-aware planning in stochastic systems, using Interval MDPs derived from Gaussian process regression to refine bounds and synthesize control policies.

Contribution

It introduces an iterative approach combining IMDP abstractions, path sampling, and heuristics to improve planning under uncertainty for systems with unknown dynamics.

Findings

01

Successfully applied to mobile robot navigation case study.

02

Achieved high-confidence bounds on system behavior.

03

Enhanced safety and performance in uncertain environments.

Abstract

We study the problem of refining satisfiability bounds for partially-known stochastic systems against planning specifications defined using syntactically co-safe Linear Temporal Logic (scLTL). We propose an abstraction-based approach that iteratively generates high-confidence Interval Markov Decision Process (IMDP) abstractions of the system from high-confidence bounds on the unknown component of the dynamics obtained via Gaussian process regression. In particular, we develop a synthesis strategy to sample the unknown dynamics by finding paths which avoid specification-violating states using a product IMDP. We further provide a heuristic to choose among various candidate paths to maximize the information gain. Finally, we propose an iterative algorithm to synthesize a satisfying control policy for the product IMDP system. We demonstrate our work with a case study on mobile robot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process