Stable In-hand Manipulation with Finger Specific Multi-agent Shadow Reward
Lingfeng Tao, Jiucai Zhang, Xiaoli Zhang

TL;DR
This paper introduces FMSR, a dense reward method for stable in-hand manipulation with multi-agent reinforcement learning, improving convergence speed and stability over traditional sparse rewards.
Contribution
The novel FMSR method determines stable manipulation constraints via dense rewards based on state-action occupancy, enhanced by information sharing among agents.
Findings
FMSR+IS converges faster than traditional methods.
FMSR+IS achieves higher success rates and stability.
Dense rewards improve manipulation stability over sparse rewards.
Abstract
Deep Reinforcement Learning has shown its capability to solve the high degrees of freedom in control and the complex interaction with the object in the multi-finger dexterous in-hand manipulation tasks. Current DRL approaches prefer sparse rewards to dense rewards for the ease of training but lack behavior constraints during the manipulation process, leading to aggressive and unstable policies that are insufficient for safety-critical in-hand manipulation tasks. Dense rewards can regulate the policy to learn stable manipulation behaviors with continuous reward constraints but are hard to empirically define and slow to converge optimally. This work proposes the Finger-specific Multi-agent Shadow Reward (FMSR) method to determine the stable manipulation constraints in the form of dense reward based on the state-action occupancy measure, a general utility of DRL that is approximated during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMuscle activation and electromyography studies · Motor Control and Adaptation · Robot Manipulation and Learning
