Contact-Safe Reinforcement Learning with ProMP Reparameterization and Energy Awareness
Bingkun Huang, Yuhe Gong, Zewen Yang, Tianyu Ren, Luis Figueredo

TL;DR
This paper introduces a novel contact-safe reinforcement learning framework that combines ProMP reparameterization with energy-aware control to improve safety, efficiency, and robustness in robotic manipulation tasks involving contact-rich environments.
Contribution
It proposes a task-space, energy-safe RL framework integrating PPO, movement primitives, and energy-aware impedance control for safer and more effective robotic manipulation.
Findings
Outperforms existing methods in success rate and safety.
Generates smooth, energy-efficient trajectories.
Effective in diverse 3D contact-rich tasks.
Abstract
Reinforcement learning (RL) approaches based on Markov Decision Processes (MDPs) are predominantly applied in the robot joint space, often relying on limited task-specific information and partial awareness of the 3D environment. In contrast, episodic RL has demonstrated advantages over traditional MDP-based methods in terms of trajectory consistency, task awareness, and overall performance in complex robotic tasks. Moreover, traditional step-wise and episodic RL methods often neglect the contact-rich information inherent in task-space manipulation, especially considering the contact-safety and robustness. In this work, contact-rich manipulation tasks are tackled using a task-space, energy-safe framework, where reliable and safe task-space trajectories are generated through the combination of Proximal Policy Optimization (PPO) and movement primitives. Furthermore, an energy-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Motor Control and Adaptation
