Optical-Flow Guided Prompt Optimization for Coherent Video Generation

Hyelin Nam; Jaemin Kim; Dohun Lee; Jong Chul Ye

arXiv:2411.15540·cs.CV·March 25, 2025

Optical-Flow Guided Prompt Optimization for Coherent Video Generation

Hyelin Nam, Jaemin Kim, Dohun Lee, Jong Chul Ye

PDF

Open Access

TL;DR

This paper introduces MotionPrompt, a novel optical-flow guided prompt optimization framework that enhances temporal coherence in video generation by training a discriminator to guide learnable prompt tokens during diffusion sampling.

Contribution

We propose a new method that uses optical flow and a discriminator to optimize prompts, improving temporal consistency in diffusion-based video generation.

Findings

01

Generated videos exhibit improved temporal coherence.

02

Method maintains high visual fidelity.

03

Effective across multiple diffusion models.

Abstract

While text-to-video diffusion models have made significant strides, many still face challenges in generating videos with temporal consistency. Within diffusion frameworks, guidance techniques have proven effective in enhancing output quality during inference; however, applying these methods to video diffusion models introduces additional complexity of handling computations across entire sequences. To address this, we propose a novel framework called MotionPrompt that guides the video generation process via optical flow. Specifically, we train a discriminator to distinguish optical flow between random pairs of frames from real videos and generated ones. Given that prompts can influence the entire video, we optimize learnable token embeddings during reverse sampling steps by using gradients from a trained discriminator applied to random frame pairs. This approach allows our method to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Optical Coherence Tomography Applications · Advanced Vision and Imaging

MethodsDiffusion