Robobarista: Object Part based Transfer of Manipulation Trajectories from Crowd-sourcing in 3D Pointclouds
Jaeyong Sung, Seok Hyun Jin, Ashutosh Saxena

TL;DR
Robobarista introduces a deep learning framework that leverages crowd-sourced 3D pointcloud data, language, and trajectories to enable robots to generalize manipulation skills across diverse household objects based on shared object parts.
Contribution
The paper presents a novel manipulation planning approach using structured prediction and multi-modal deep learning, along with a new crowd-sourcing platform for data collection.
Findings
Successfully learned manipulation trajectories from noisy crowd-sourced data.
Demonstrated generalization to unseen objects based on shared parts.
Achieved effective manipulation across 116 objects with 249 parts.
Abstract
There is a large variety of objects and appliances in human environments, such as stoves, coffee dispensers, juice extractors, and so on. It is challenging for a roboticist to program a robot for each of these object types and for each of their instantiations. In this work, we present a novel approach to manipulation planning based on the idea that many household objects share similarly-operated object parts. We formulate the manipulation planning as a structured prediction problem and design a deep learning model that can handle large noise in the manipulation demonstrations and learns features from three different modalities: point-clouds, language and trajectory. In order to collect a large number of manipulation demonstrations for different objects, we developed a new crowd-sourcing platform called Robobarista. We test our model on our dataset consisting of 116 objects with 249…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
