T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design

Jiachen Li; Qian Long; Jian Zheng; Xiaofeng Gao; Robinson Piramuthu; Wenhu Chen; William Yang Wang

arXiv:2410.05677·cs.CV·September 17, 2025

T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design

Jiachen Li, Qian Long, Jian Zheng, Xiaofeng Gao, Robinson Piramuthu, Wenhu Chen, William Yang Wang

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces T2V-Turbo-v2, a post-training enhancement for text-to-video diffusion models that leverages data, reward feedback, and conditional guidance to improve video quality and motion accuracy, achieving state-of-the-art results.

Contribution

The paper presents a novel post-training distillation method integrating supervision signals and conditional guidance to significantly improve T2V models.

Findings

01

Achieves new state-of-the-art on VBench with a score of 85.13.

02

Effectively improves motion quality in generated videos.

03

Demonstrates the importance of dataset tailoring and reward models.

Abstract

In this paper, we focus on enhancing a diffusion-based text-to-video (T2V) model during the post-training phase by distilling a highly capable consistency model from a pretrained T2V model. Our proposed method, T2V-Turbo-v2, introduces a significant advancement by integrating various supervision signals, including high-quality training data, reward model feedback, and conditional guidance, into the consistency distillation process. Through comprehensive ablation studies, we highlight the crucial importance of tailoring datasets to specific learning objectives and the effectiveness of learning from diverse reward models for enhancing both the visual quality and text-video alignment. Additionally, we highlight the vast design space of conditional guidance strategies, which centers on designing an effective energy function to augment the teacher ODE solver. We demonstrate the potential of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Ji4chenLi/t2v-turbo
pytorch

Models

🤗
jiachenli-ucsb/T2V-Turbo-v2-no-MG
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks

MethodsFocus