DemoGrasp: Few-Shot Learning for Robotic Grasping with Human Demonstration
Pengyuan Wang, Fabian Manhardt, Luca Minciullo, Lorenzo Garattoni,, Sven Meie, Nassir Navab, Benjamin Busam

TL;DR
This paper introduces DemoGrasp, a few-shot learning method that enables robots to grasp objects using minimal human demonstration, overcoming the need for large datasets or specific geometries, and demonstrating effectiveness in real and synthetic environments.
Contribution
DemoGrasp is a novel approach that leverages a short human demonstration to teach robots grasping, eliminating the need for extensive annotations or geometric restrictions.
Findings
Effective in real and synthetic environments
Outperforms previous methods in generalization
Requires minimal human demonstration
Abstract
The ability to successfully grasp objects is crucial in robotics, as it enables several interactive downstream applications. To this end, most approaches either compute the full 6D pose for the object of interest or learn to predict a set of grasping points. While the former approaches do not scale well to multiple object instances or classes yet, the latter require large annotated datasets and are hampered by their poor generalization capabilities to new geometries. To overcome these shortcomings, we propose to teach a robot how to grasp an object with a simple and short human demonstration. Hence, our approach neither requires many annotated images nor is it restricted to a specific geometry. We first present a small sequence of RGB-D images displaying a human-object interaction. This sequence is then leveraged to build associated hand and object meshes that represent the depicted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
