Stream-T1: Test-Time Scaling for Streaming Video Generation

Yijing Tu; Shaojin Wu; Mengqi Huang; Wenchuan Wang; Yuxin Wang; Chunxiao Liu; Zhendong Mao

arXiv:2605.04461·cs.CV·May 7, 2026

Stream-T1: Test-Time Scaling for Streaming Video Generation

Yijing Tu, Shaojin Wu, Mengqi Huang, Wenchuan Wang, Yuxin Wang, Chunxiao Liu, Zhendong Mao

PDF

1 Repo

TL;DR

Stream-T1 introduces a test-time scaling framework for streaming video generation that reduces computational costs and enhances temporal coherence by leveraging chunk-level synthesis and historical information.

Contribution

It presents a novel TTS framework specifically designed for streaming video, incorporating three units to improve temporal dependency, coherence, and visual quality.

Findings

01

Significantly improves temporal consistency and motion smoothness.

02

Reduces computational overhead compared to existing methods.

03

Achieves superior visual quality on benchmark datasets.

Abstract

While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training, current test-time video generation methods based on diffusion models suffer from exorbitant candidate exploration costs and lack temporal guidance. To address these structural bottlenecks, we propose shifting the focus to streaming video generation. We identify that its chunk-level synthesis and few denoising steps are intrinsically suited for TTS, significantly lowering computational overhead while enabling fine-grained temporal control. Driven by this insight, we introduced Stream-T1, a pioneering comprehensive TTS framework exclusively tailored for streaming video generation. Specifically, Stream-T1 is composed of three units: (1) Stream -Scaled Noise Propagation, which actively refines the initial latent noise of the generating chunk using historically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

framex-ai/Stream-T1
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.