OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Jiacheng Zhang, Jie Wu, Weifeng Chen, Yatai Ji, Xuefeng Xiao, Weilin Huang, Kai Han

TL;DR
OnlineVPO introduces a video-specific preference learning framework for video diffusion models, leveraging video quality assessment models for better human-aligned feedback and an online DPO algorithm for scalable, high-quality video generation.
Contribution
The paper presents a novel preference learning framework tailored for VDMs, utilizing VQA models for feedback and an online DPO algorithm for improved scalability and optimization.
Findings
VQA models outperform image-level reward models in aligning with human video preferences.
OnlineVPO achieves higher resolution and longer video generation with better scalability.
Extensive experiments validate the effectiveness and scalability of the proposed method.
Abstract
Video diffusion models (VDMs) have demonstrated remarkable capabilities in text-to-video (T2V) generation. Despite their success, VDMs still suffer from degraded image quality and flickering artifacts. To address these issues, some approaches have introduced preference learning to exploit human feedback to enhance the video generation. However, these methods primarily adopt the routine in the image domain without an in-depth investigation into video-specific preference optimization. In this paper, we reexamine the design of the video preference learning from two key aspects: feedback source and feedback tuning methodology, and present OnlineVPO, a more efficient preference learning framework tailored specifically for VDMs. On the feedback source, we found that the image-level reward model commonly used in existing methods fails to provide a human-aligned video preference signal due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Advanced Image and Video Retrieval Techniques
MethodsDiffusion · Direct Preference Optimization
