Conformal Prediction Intervals for Markov Decision Process Trajectories

Thomas G. Dietterich; Jesse Hostetler

arXiv:2206.04860·cs.LG·June 23, 2022·6 cites

Conformal Prediction Intervals for Markov Decision Process Trajectories

Thomas G. Dietterich, Jesse Hostetler

PDF

Open Access

TL;DR

This paper develops conformal prediction intervals for future trajectories of autonomous systems in Markov Decision Processes, providing probabilistic guarantees on their behavior for safety and reliability.

Contribution

It extends conformal prediction methods to MDP trajectories, combining quantile regression with conformal corrections for reliable future behavior intervals.

Findings

01

Guarantees that observed trajectories lie within intervals with probability 1-δ

02

Applied method to invasive species management MDPs

03

Demonstrated approach on StarCraft2 battles

Abstract

Before delegating a task to an autonomous system, a human operator may want a guarantee about the behavior of the system. This paper extends previous work on conformal prediction for functional data and conformalized quantile regression to provide conformal prediction intervals over the future behavior of an autonomous system executing a fixed control policy on a Markov Decision Process (MDP). The prediction intervals are constructed by applying conformal corrections to prediction intervals computed by quantile regression. The resulting intervals guarantee that with probability $1 - δ$ the observed trajectory will lie inside the prediction interval, where the probability is computed with respect to the starting state distribution and the stochasticity of the MDP. The method is illustrated on MDPs for invasive species management and StarCraft2 battles.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Mass Spectrometry Techniques and Applications · Markov Chains and Monte Carlo Methods