FlowBot3D: Learning 3D Articulation Flow to Manipulate Articulated Objects
Ben Eisner, Harry Zhang, David Held

TL;DR
FlowBot3D introduces a vision-based neural network that predicts 3D articulation flows, enabling robots to manipulate unseen articulated objects with high accuracy in both simulation and real-world scenarios.
Contribution
A novel neural network predicts dense motion vector fields for 3D articulated objects, generalizing to unseen classes without fine-tuning.
Findings
Achieves state-of-the-art performance in simulation and real-world tests.
Successfully generalizes to unseen object instances and categories.
Operates effectively on a real robot without additional training.
Abstract
We explore a novel method to perceive and manipulate 3D articulated objects that generalizes to enable a robot to articulate unseen classes of objects. We propose a vision-based system that learns to predict the potential motions of the parts of a variety of articulated objects to guide downstream motion planning of the system to articulate the objects. To predict the object motions, we train a neural network to output a dense vector field representing the point-wise motion direction of the points in the point cloud under articulation. We then deploy an analytical motion planner based on this vector field to achieve a policy that yields maximum articulation. We train the vision system entirely in simulation, and we demonstrate the capability of our system to generalize to unseen object instances and novel categories in both simulation and the real world, deploying our policy on a Sawyer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Multimodal Machine Learning Applications
