Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang,, Stephen Marcus McAleer, Yiran Geng, Hao Dong, Zongqing Lu, Song-Chun Zhu,, Yaodong Yang

TL;DR
This paper introduces Bi-DexHands, a high-fidelity simulator for bimanual dexterous manipulation tasks, and benchmarks various RL algorithms, revealing strengths and limitations in achieving human-level robotic dexterity.
Contribution
The study presents a new simulator for bimanual manipulation, comprehensive RL benchmarks, and insights into the current capabilities and challenges of RL in dexterous robotic tasks.
Findings
PPO algorithms master simple tasks like catching and opening bottles.
Multi-agent RL improves bimanual cooperation skills.
Existing RL algorithms struggle with multi-task and few-shot learning in dexterous manipulation.
Abstract
Achieving human-level dexterity is an important open problem in robotics. However, tasks of dexterous hand manipulation, even at the baby level, are challenging to solve through reinforcement learning (RL). The difficulty lies in the high degrees of freedom and the required cooperation among heterogeneous agents (e.g., joints of fingers). In this study, we propose the Bimanual Dexterous Hands Benchmark (Bi-DexHands), a simulator that involves two dexterous hands with tens of bimanual manipulation tasks and thousands of target objects. Specifically, tasks in Bi-DexHands are designed to match different levels of human motor skills according to cognitive science literature. We built Bi-DexHands in the Issac Gym; this enables highly efficient RL training, reaching 30,000+ FPS by only one single NVIDIA RTX 3090. We provide a comprehensive benchmark for popular RL algorithms under different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Robot Manipulation and Learning
MethodsEntropy Regularization · Proximal Policy Optimization
