Towards Online Safety Corrections for Robotic Manipulation Policies

Ariana Spalter; Mark Roberts; Laura M. Hiatt

arXiv:2409.08233·cs.RO·September 13, 2024

Towards Online Safety Corrections for Robotic Manipulation Policies

Ariana Spalter, Mark Roberts, Laura M. Hiatt

PDF

Open Access

TL;DR

This paper introduces iKinQP-RL, a hybrid method combining inverse kinematics quadratic programming with reinforcement learning to ensure robotic safety by preventing collisions with new obstacles during task execution.

Contribution

The paper proposes a novel hybrid approach that corrects RL-predicted actions in real-time using iKinQP, enhancing safety in robotic manipulation tasks.

Findings

01

Eliminates collisions with new obstacles during execution.

02

Maintains high task success rate.

03

Ensures safe operation in dynamic environments.

Abstract

Recent successes in applying reinforcement learning (RL) for robotics has shown it is a viable approach for constructing robotic controllers. However, RL controllers can produce many collisions in environments where new obstacles appear during execution. This poses a problem in safety-critical settings. We present a hybrid approach, called iKinQP-RL, that uses an Inverse Kinematics Quadratic Programming (iKinQP) controller to correct actions proposed by an RL policy at runtime. This ensures safe execution in the presence of new obstacles not present during training. Preliminary experiments illustrate our iKinQP-RL framework completely eliminates collisions with new obstacles while maintaining a high task success rate.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Safety Systems Engineering in Autonomy · Software Reliability and Analysis Research