Learning Gentle Object Manipulation with Curiosity-Driven Deep   Reinforcement Learning

Sandy H. Huang; Martina Zambelli; Jackie Kay; Murilo F. Martins; Yuval; Tassa; Patrick M. Pilarski; Raia Hadsell

arXiv:1903.08542·cs.RO·March 21, 2019·45 cites

Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning

Sandy H. Huang, Martina Zambelli, Jackie Kay, Murilo F. Martins, Yuval, Tassa, Patrick M. Pilarski, Raia Hadsell

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach that encourages gentle robot manipulation by combining penalty-based surprise intrinsic rewards with task rewards, leading to safer and more effective contact-rich interactions.

Contribution

It demonstrates that penalty-based surprise intrinsic rewards improve learning stability and exploration in gentle manipulation tasks, outperforming simple dynamics-based surprise.

Findings

01

Penalty-based surprise enhances exploration and safety.

02

The approach works on a complex tendon-powered robot hand.

03

Videos demonstrate successful gentle manipulation.

Abstract

Robots must know how to be gentle when they need to interact with fragile objects, or when the robot itself is prone to wear and tear. We propose an approach that enables deep reinforcement learning to train policies that are gentle, both during exploration and task execution. In a reward-based learning environment, a natural approach involves augmenting the (task) reward with a penalty for non-gentleness, which can be defined as excessive impact force. However, augmenting with only this penalty impairs learning: policies get stuck in a local optimum which avoids all contact with the environment. Prior research has shown that combining auxiliary tasks or intrinsic rewards can be beneficial for stabilizing and accelerating learning in sparse-reward domains, and indeed we find that introducing a surprise-based intrinsic reward does avoid the no-contact failure case. However, we show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Neural dynamics and brain function