Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Zhengyao Lv; Chenyang Si; Tianlin Pan; Zhaoxi Chen; Kwan-Yee K. Wong; Yu Qiao; Ziwei Liu

arXiv:2506.03123·cs.CV·August 7, 2025

Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Zhengyao Lv, Chenyang Si, Tianlin Pan, Zhaoxi Chen, Kwan-Yee K. Wong, Yu Qiao, Ziwei Liu

PDF

1 Models

TL;DR

This paper introduces a Dual-Expert Consistency Model that enhances video generation by separately focusing on semantic motion and fine details, achieving high quality with fewer sampling steps.

Contribution

It proposes a novel dual-expert framework and training strategies to improve temporal consistency and visual quality in video diffusion models.

Findings

01

State-of-the-art visual quality achieved

02

Significant reduction in sampling steps

03

Enhanced temporal coherence and detail fidelity

Abstract

Diffusion Models have achieved remarkable results in video synthesis but require iterative denoising steps, leading to substantial computational overhead. Consistency Models have made significant progress in accelerating diffusion models. However, directly applying them to video diffusion models often results in severe degradation of temporal consistency and appearance details. In this paper, by analyzing the training dynamics of Consistency Models, we identify a key conflicting learning dynamics during the distillation process: there is a significant discrepancy in the optimization gradients and loss contributions across different timesteps. This discrepancy prevents the distilled student model from achieving an optimal state, leading to compromised temporal consistency and degraded appearance details. To address this issue, we propose a parameter-efficient \textbf{Dual-Expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
cszy98/DCM
model· ♡ 9
♡ 9

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.