MP1: MeanFlow Tames Policy Learning in 1-step for Robotic Manipulation

Juyi Sheng; Ziyi Wang; Peiming Li; Mengyuan Liu

arXiv:2507.10543·cs.RO·December 4, 2025

MP1: MeanFlow Tames Policy Learning in 1-step for Robotic Manipulation

Juyi Sheng, Ziyi Wang, Peiming Li, Mengyuan Liu

PDF

1 Video

TL;DR

MP1 introduces a novel policy learning method for robotic manipulation that generates action trajectories in one network evaluation, improving accuracy and speed while enhancing generalization and controllability.

Contribution

The paper proposes MP1, a new approach combining MeanFlow with 3D point-cloud inputs for one-step trajectory generation without consistency constraints, improving efficiency and accuracy in robot learning.

Findings

01

Outperforms state-of-the-art methods on benchmarks

02

Achieves 10.2% higher success rates than DP3

03

Inference is 19x faster than DP3 and nearly 2x faster than FlowPolicy

Abstract

In robot manipulation, robot learning has become a prevailing approach. However, generative models within this field face a fundamental trade-off between the slow, iterative sampling of diffusion models and the architectural constraints of faster Flow-based methods, which often rely on explicit consistency losses. To address these limitations, we introduce MP1, which pairs 3D point-cloud inputs with the MeanFlow paradigm to generate action trajectories in one network function evaluation (1-NFE). By directly learning the interval-averaged velocity via the "MeanFlow Identity", our policy avoids any additional consistency constraints. This formulation eliminates numerical ODE-solver errors during inference, yielding more precise trajectories. MP1 further incorporates CFG for improved trajectory controllability while retaining 1-NFE inference without reintroducing structural constraints.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MP1: MeanFlow Tames Policy Learning in 1-step for Robotic Manipulation· underline