FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens

Yiming Zhong; Yumeng Liu; Chuyang Xiao; Zemin Yang; Youzhuo Wang; Yufei Zhu; Ye Shi; Yujing Sun; Xinge Zhu; Yuexin Ma

arXiv:2506.01583·cs.RO·October 7, 2025

FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens

Yiming Zhong, Yumeng Liu, Chuyang Xiao, Zemin Yang, Youzhuo Wang, Yufei Zhu, Ye Shi, Yujing Sun, Xinge Zhu, Yuexin Ma

PDF

Open Access

TL;DR

FreqPolicy introduces a hierarchical frequency-domain autoregressive approach with continuous tokens for visuomotor policy learning, improving robotic manipulation accuracy and efficiency by capturing motion structure effectively.

Contribution

It proposes a novel frequency-based hierarchical modeling paradigm with continuous latent representations for visuomotor policies, advancing beyond existing methods.

Findings

01

Outperforms existing methods in accuracy and efficiency

02

Effective modeling of motion structure via frequency components

03

Demonstrates generalization across diverse robotic tasks

Abstract

Learning effective visuomotor policies for robotic manipulation is challenging, as it requires generating precise actions while maintaining computational efficiency. Existing methods remain unsatisfactory due to inherent limitations in the essential action representation and the basic network architectures. We observe that representing actions in the frequency domain captures the structured nature of motion more effectively: low-frequency components reflect global movement patterns, while high-frequency components encode fine local details. Additionally, robotic manipulation tasks of varying complexity demand different levels of modeling precision across these frequency bands. Motivated by this, we propose a novel paradigm for visuomotor policy learning that progressively models hierarchical frequency components. To further enhance precision, we introduce continuous latent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural dynamics and brain function · Visual perception and processing mechanisms · Neural Networks and Reservoir Computing