Stable In-hand Manipulation with Finger Specific Multi-agent Shadow   Reward

Lingfeng Tao; Jiucai Zhang; Xiaoli Zhang

arXiv:2309.07349·cs.RO·September 15, 2023

Stable In-hand Manipulation with Finger Specific Multi-agent Shadow Reward

Lingfeng Tao, Jiucai Zhang, Xiaoli Zhang

PDF

Open Access

TL;DR

This paper introduces FMSR, a dense reward method for stable in-hand manipulation with multi-agent reinforcement learning, improving convergence speed and stability over traditional sparse rewards.

Contribution

The novel FMSR method determines stable manipulation constraints via dense rewards based on state-action occupancy, enhanced by information sharing among agents.

Findings

01

FMSR+IS converges faster than traditional methods.

02

FMSR+IS achieves higher success rates and stability.

03

Dense rewards improve manipulation stability over sparse rewards.

Abstract

Deep Reinforcement Learning has shown its capability to solve the high degrees of freedom in control and the complex interaction with the object in the multi-finger dexterous in-hand manipulation tasks. Current DRL approaches prefer sparse rewards to dense rewards for the ease of training but lack behavior constraints during the manipulation process, leading to aggressive and unstable policies that are insufficient for safety-critical in-hand manipulation tasks. Dense rewards can regulate the policy to learn stable manipulation behaviors with continuous reward constraints but are hard to empirically define and slow to converge optimally. This work proposes the Finger-specific Multi-agent Shadow Reward (FMSR) method to determine the stable manipulation constraints in the form of dense reward based on the state-action occupancy measure, a general utility of DRL that is approximated during…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMuscle activation and electromyography studies · Motor Control and Adaptation · Robot Manipulation and Learning