TL;DR
MP1 introduces a novel policy learning method for robotic manipulation that generates action trajectories in one network evaluation, improving accuracy and speed while enhancing generalization and controllability.
Contribution
The paper proposes MP1, a new approach combining MeanFlow with 3D point-cloud inputs for one-step trajectory generation without consistency constraints, improving efficiency and accuracy in robot learning.
Findings
Outperforms state-of-the-art methods on benchmarks
Achieves 10.2% higher success rates than DP3
Inference is 19x faster than DP3 and nearly 2x faster than FlowPolicy
Abstract
In robot manipulation, robot learning has become a prevailing approach. However, generative models within this field face a fundamental trade-off between the slow, iterative sampling of diffusion models and the architectural constraints of faster Flow-based methods, which often rely on explicit consistency losses. To address these limitations, we introduce MP1, which pairs 3D point-cloud inputs with the MeanFlow paradigm to generate action trajectories in one network function evaluation (1-NFE). By directly learning the interval-averaged velocity via the "MeanFlow Identity", our policy avoids any additional consistency constraints. This formulation eliminates numerical ODE-solver errors during inference, yielding more precise trajectories. MP1 further incorporates CFG for improved trajectory controllability while retaining 1-NFE inference without reintroducing structural constraints.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
