Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation
Sridhar Pandian Arunachalam, Sneha Silwal, Ben Evans, Lerrel Pinto

TL;DR
This paper introduces DIME, a simple and efficient imitation learning framework that enables dexterous robotic manipulation using only a single RGB camera for demonstration collection, achieving complex in-hand tasks.
Contribution
DIME is a novel imitation learning framework that simplifies data collection and effectively trains dexterous manipulation policies with minimal equipment.
Findings
DIME successfully learns complex in-hand manipulation tasks.
The framework works on both simulation and real robots.
It requires only a single RGB camera for demonstration collection.
Abstract
Optimizing behaviors for dexterous manipulation has been a longstanding challenge in robotics, with a variety of methods from model-based control to model-free reinforcement learning having been previously explored in literature. Perhaps one of the most powerful techniques to learn complex manipulation strategies is imitation learning. However, collecting and learning from demonstrations in dexterous manipulation is quite challenging. The complex, high-dimensional action-space involved with multi-finger control often leads to poor sample efficiency of learning-based methods. In this work, we propose 'Dexterous Imitation Made Easy' (DIME) a new imitation learning framework for dexterous manipulation. DIME only requires a single RGB camera to observe a human operator and teleoperate our robotic hand. Once demonstrations are collected, DIME employs standard imitation learning methods to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition
MethodsDistance to Modelled Embedding
