T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li, Qian Long, Jian Zheng, Xiaofeng Gao, Robinson Piramuthu, Wenhu Chen, William Yang Wang

TL;DR
This paper introduces T2V-Turbo-v2, a post-training enhancement for text-to-video diffusion models that leverages data, reward feedback, and conditional guidance to improve video quality and motion accuracy, achieving state-of-the-art results.
Contribution
The paper presents a novel post-training distillation method integrating supervision signals and conditional guidance to significantly improve T2V models.
Findings
Achieves new state-of-the-art on VBench with a score of 85.13.
Effectively improves motion quality in generated videos.
Demonstrates the importance of dataset tailoring and reward models.
Abstract
In this paper, we focus on enhancing a diffusion-based text-to-video (T2V) model during the post-training phase by distilling a highly capable consistency model from a pretrained T2V model. Our proposed method, T2V-Turbo-v2, introduces a significant advancement by integrating various supervision signals, including high-quality training data, reward model feedback, and conditional guidance, into the consistency distillation process. Through comprehensive ablation studies, we highlight the crucial importance of tailoring datasets to specific learning objectives and the effectiveness of learning from diverse reward models for enhancing both the visual quality and text-video alignment. Additionally, we highlight the vast design space of conditional guidance strategies, which centers on designing an effective energy function to augment the teacher ODE solver. We demonstrate the potential of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks
MethodsFocus
