EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation

Yue Ma; Xu Ye; Qinghe Wang; Yucheng Wang; Hongyu Liu; Yinhan Zhang; Xinyu Wang; Yuanpeng Che; Shanhui Mo; Paul Liang; Fangneng Zhan; Qifeng Chen

arXiv:2605.22051·cs.CV·May 22, 2026

EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation

Yue Ma, Xu Ye, Qinghe Wang, Yucheng Wang, Hongyu Liu, Yinhan Zhang, Xinyu Wang, Yuanpeng Che, Shanhui Mo, Paul Liang, Fangneng Zhan, Qifeng Chen

PDF

TL;DR

EasyVFX introduces a frequency-domain decoupling framework that significantly reduces resource requirements for high-quality visual effects generation by separating spatial and temporal components.

Contribution

The paper presents a novel frequency-aware mixture-of-experts architecture and a test-time training strategy that enable efficient and adaptable VFX synthesis with limited data and computational resources.

Findings

01

Achieves realistic VFX with fewer GPU resources.

02

Enables rapid adaptation to unseen effects with minimal steps.

03

Produces visually consistent and high-quality effects.

Abstract

Generating high-fidelity visual effects (VFX) typically demands massive datasets and prohibitive computational power due to the intricate coupling of spatial textures and temporal dynamics. In this paper, we introduce EasyVFX, a resource-efficient framework that achieves realistic VFX synthesis under stringent constraints. Our core philosophy lies in frequency-domain decomposition: we observe that the complexity of VFX can be significantly mitigated by decoupling high-frequency components, which represent intricate spatial appearances, from low-frequency components that encapsulate global motion dynamics. This spectral disentanglement transforms a high-dimensional learning problem into manageable sub-tasks, thereby lowering the optimization barrier and reducing data dependency. Building upon this insight, we propose a two-stage training paradigm. First, we design a Frequency-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.