HybridFlow: A Two-Step Generative Policy for Robotic Manipulation
Zhenchen Dong, Jinna Fu, Jiaming Wu, Shengyuan Yu, Fulin Chen, Yide Liu

TL;DR
HybridFlow is a novel three-stage, low-latency robotic manipulation policy that balances speed and precision, significantly outperforming existing diffusion-based methods in success rate and inference time.
Contribution
It introduces HybridFlow, a three-stage generative policy combining MeanFlow and ReFlow modes, achieving fast and precise robot actions with minimal inference steps.
Findings
Outperforms 16-step Diffusion Policy by 15-25% success rate
Reduces inference time from 152ms to 19ms (8x speedup)
Achieves 70.0% success on unseen-color OOD grasping
Abstract
Limited by inference latency, existing robot manipulation policies lack sufficient real-time interaction capability with the environment. Although faster generation methods such as flow matching are gradually replacing diffusion methods, researchers are pursuing even faster generation suitable for interactive robot control. MeanFlow, as a one-step variant of flow matching, has shown strong potential in image generation, but its precision in action generation does not meet the stringent requirements of robotic manipulation. We therefore propose \textbf{HybridFlow}, a \textbf{3-stage method} with \textbf{2-NFE}: Global Jump in MeanFlow mode, ReNoise for distribution alignment, and Local Refine in ReFlow mode. This method balances inference speed and generation quality by leveraging the rapid advantage of MeanFlow one-step generation while ensuring action precision with minimal generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · Soft Robotics and Applications
