To Do or Not to Do: Ensuring the Safety of Visuomotor Policies Learned from Demonstrations
Riad Ahmed, Moniruzzaman Akash, and Momotaz Begum

TL;DR
This paper introduces a safety guarantee method for visuomotor imitation learning policies, ensuring maximum task success within a safe state region, validated through experiments on a Franka robot.
Contribution
It proposes a novel execution guarantee measure using view synthesis and set invariance principles to enhance safety in IL policies.
Findings
Guarantees maximum task success despite minor runtime changes.
Uses view synthesis to identify safe state regions.
Recovery policy improves performance and safety tradeoff.
Abstract
Task success has historically been the primary measure of policy performance in imitation learning (IL) research. This characteristics strictly limits the ubiquitous applications of IL algorithms in field robotics where safety assurance, in addition to task-success, is of paramount importance. It is often desirable for an IL-powered robot in the field not to roll out a policy, and hence score a poor performance, if the safety is not guaranteed. Although this trade-off between safety and performance is well investigated in classical control literature, policy safety is a heavily underexplored domain in IL research. There is no universal definition of safety in IL. To make things worst, many existing theoretical works on safety is notoriously difficult to extend to IL-powered robots in the field. This paper offers important insights on the safety and performance of IL policies. We propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
