Tuning-free Visual Effect Transfer across Videos

Maxwell Jones; Rameen Abdal; Or Patashnik; Ruslan Salakhutdinov; Sergey Tulyakov; Jun-Yan Zhu; Kuan-Chieh Jackson Wang

arXiv:2601.07833·cs.CV·February 20, 2026

Tuning-free Visual Effect Transfer across Videos

Maxwell Jones, Rameen Abdal, Or Patashnik, Ruslan Salakhutdinov, Sergey Tulyakov, Jun-Yan Zhu, Kuan-Chieh Jackson Wang

PDF

Open Access

TL;DR

RefVFX introduces a scalable, tuning-free framework for transferring complex temporal visual effects from reference videos to target videos or images, enabling consistent and coherent edits without extensive retraining.

Contribution

The paper presents a novel dataset creation pipeline and a reference-conditioned model that effectively transfer dynamic effects across videos without the need for tuning or prompt engineering.

Findings

01

Outperforms prompt-only baselines in quality and coherence

02

Generalizes well to unseen effect categories

03

Produces visually consistent, temporally coherent edits

Abstract

We present RefVFX, a new framework that transfers complex temporal effects from a reference video onto a target video or image in a feed-forward manner. While existing methods excel at prompt-based or keyframe-conditioned editing, they struggle with dynamic temporal effects such as dynamic lighting changes or character transformations, which are difficult to describe via text or static conditions. Transferring a video effect is challenging, as the model must integrate the new temporal dynamics with the input video's existing motion and appearance. % To address this, we introduce a large-scale dataset of triplets, where each triplet consists of a reference effect video, an input image or video, and a corresponding output video depicting the transferred effect. Creating this data is non-trivial, especially the video-to-video effect triplets, which do not exist naturally. To generate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Vision and Imaging