FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency
Yifei Su, Ning Liu, Dong Chen, Zhen Zhao, Kun Wu, Meng Li, Zhiyuan Xu, Zhengping Che, Jian Tang

TL;DR
FreqPolicy introduces a frequency consistency constraint to flow-based visuomotor policies, enabling efficient, temporally coherent action generation suitable for real-time robotic manipulation and outperforming existing methods.
Contribution
It proposes a novel frequency consistency constraint for flow-based policies, improving temporal coherence and inference efficiency in robotic visuomotor tasks.
Findings
Outperforms existing one-step action generators on 53 simulation tasks.
Achieves acceleration in vision-language-action models without performance loss.
Operates at 93.5 Hz inference frequency in real-world scenarios.
Abstract
Generative modeling-based visuomotor policies have been widely adopted in robotic manipulation, attributed to their ability to model multimodal action distributions. However, the high inference cost of multi-step sampling limits its applicability in real-time robotic systems. Existing approaches accelerate sampling in generative modeling-based visuomotor policies by adapting techniques originally developed to speed up image generation. However, a major distinction exists: image generation typically produces independent samples without temporal dependencies, while robotic manipulation requires generating action trajectories with continuity and temporal coherence. To this end, we propose FreqPolicy, a novel approach that first imposes frequency consistency constraints on flow-based visuomotor policies. Our work enables the action model to capture temporal structure effectively while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics
