Guarantees on Robot System Performance Using Stochastic Simulation Rollouts
Joseph A. Vincent, Aaron O. Feldman, Mac Schwager

TL;DR
This paper introduces a method to provide finite-sample safety and performance guarantees for stochastic robotic control policies through simulation rollouts, applicable to various policies and robust to distribution shifts.
Contribution
It offers a novel, distribution-agnostic approach to certify safety and performance of robotic policies using only simulation data, including robustness to distribution shifts and multi-hypothesis policy selection.
Findings
Validates bounds in MuJoCo environments including Ant and Half-cheetah.
Demonstrates safety verification on MuJoCo's Ant robot.
Shows importance of multi-hypothesis correction for complex systems.
Abstract
We provide finite-sample performance guarantees for control policies executed on stochastic robotic systems. Given an open- or closed-loop policy and a finite set of trajectory rollouts under the policy, we bound the expected value, value-at-risk, and conditional-value-at-risk of the trajectory cost, and the probability of failure in a sparse cost setting. The bounds hold, with user-specified probability, for any policy synthesis technique and can be seen as a post-design safety certification. Generating the bounds only requires sampling simulation rollouts, without assumptions on the distribution or complexity of the underlying stochastic system. We adapt these bounds to also give a constraint satisfaction test to verify safety of the robot system. We provide a thorough analysis of the bound sensitivity to sim-to-real distribution shifts and provide results for constructing robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Formal Methods in Verification · Bayesian Modeling and Causal Inference
