3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation
Nikolaos Gkanatsios, Jiahe Xu, Matthew Bronars, Arsalan Mousavian, Tsung-Wei Ke, Katerina Fragkiadaki

TL;DR
The paper introduces 3D FlowMatch Actor (3DFA), a fast and effective 3D policy architecture for robot manipulation that combines flow matching with 3D visual representations, achieving state-of-the-art results in various benchmarks.
Contribution
It presents a novel 3D policy architecture that integrates flow matching with 3D visual scene representations, significantly improving training speed and performance over prior diffusion-based methods.
Findings
Achieves over 30x faster training and inference than previous methods.
Sets new state-of-the-art on the PerAct2 benchmark with 41.4% margin.
Outperforms baselines on 74 RLBench tasks with fewer assumptions.
Abstract
We present 3D FlowMatch Actor (3DFA), a 3D policy architecture for robot manipulation that combines flow matching for trajectory prediction with 3D pretrained visual scene representations for learning from demonstration. 3DFA leverages 3D relative attention between action and visual tokens during action denoising, building on prior work in 3D diffusion-based single-arm policy learning. Through a combination of flow matching and targeted system-level and architectural optimizations, 3DFA achieves over 30x faster training and inference than previous 3D diffusion-based policies, without sacrificing performance. On the bimanual PerAct2 benchmark, it establishes a new state of the art, outperforming the next-best method by an absolute margin of 41.4%. In extensive real-world evaluations, it surpasses strong baselines with up to 1000x more parameters and significantly more pretraining. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
