Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos

X. Feng; J. Zhu; M. Wu; C. Chen; F. Mao; H. Guo; J. Wu; X. Chu; K. Huang

arXiv:2605.18233·cs.CV·May 19, 2026

Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos

X. Feng, J. Zhu, M. Wu, C. Chen, F. Mao, H. Guo, J. Wu, X. Chu, K. Huang

PDF

1 Repo

TL;DR

MIGA is a novel train-free method for long video generation that enhances temporal consistency and reduces training-inference gaps using a two-stage alignment and dual consistency mechanisms.

Contribution

It introduces a two-stage alignment and dual consistency enhancement to improve long video generation without additional training.

Findings

01

Achieves state-of-the-art results on VBench and NarrLV datasets.

02

Effectively mitigates training-inference mismatch and maintains long-term consistency.

03

Demonstrates superior performance over existing train-free long video generation methods.

Abstract

Without incurring significant computational overhead, train-free long video generation aims to enable foundation video generation models to produce longer videos. Frame-level autoregressive frameworks, e.g., FIFO-diffusion, offer the advantage of generating infinitely long videos with constant memory consumption. However, the mismatch between training and inference, coupled with the challenge of maintaining long-term consistency, limits the effective utilization of foundation models. To mitigate these concerns, we propose \textbf{MIGA}, a novel infinite-frame long video generation method. Firstly, we propose an effective two-stage alignment mechanism that mitigates the training-inference gap by reducing the excessive noise span fed to the model. We then introduce an innovative dual consistency enhancement mechanism, where the self-reflection approach corrects early high-noise frames and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://xiaokunfeng.github.io/miga_homepage
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.