Smaller is Better: Generative Models Can Power Short Video Preloading

Liming Liu; Jiangkai Wu; Xinggong Zhang

arXiv:2602.09484·eess.IV·February 11, 2026

Smaller is Better: Generative Models Can Power Short Video Preloading

Liming Liu, Jiangkai Wu, Xinggong Zhang

PDF

Open Access

TL;DR

PromptPream leverages local computation and generative models to prefetch short videos efficiently, significantly reducing bandwidth waste and playback stalls while enhancing user experience.

Contribution

This paper introduces PromptPream, a novel preloading paradigm that uses semantic prompts and generative models to optimize bandwidth and reduce stalls in short video streaming.

Findings

01

Reduces stalls and bandwidth waste by over 31%.

02

Improves Quality of Experience (QoE) by 45%.

03

Employs novel prompt inversion and scheduling algorithms.

Abstract

Preloading is widely used in short video platforms to minimize playback stalls by downloading future content in advance. However, existing strategies face a tradeoff. Aggressive preloading reduces stalls but wastes bandwidth, while conservative strategies save data but increase the risk of playback stalls. This paper presents PromptPream, a computation powered preloading paradigm that breaks this tradeoff by using local computation to reduce bandwidth demand. Instead of transmitting pixel level video chunks, PromptPream sends compact semantic prompts that are decoded into high quality frames using generative models such as Stable Diffusion. We propose three core techniques to enable this paradigm: (1) a gradient based prompt inversion method that compresses frames into small sets of compact token embeddings; (2) a computation aware scheduling strategy that jointly optimizes network and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Peer-to-Peer Network Technologies