HybridPrompt: Bridging Generative Priors and Traditional Codecs for Mobile Streaming
Liming Liu, Jiangkai Wu, Haoyang Wang, Peiheng Wang, Zongming Guo, Xinggong Zhang

TL;DR
HybridPrompt combines traditional codecs and generative neural models to enable real-time, high-quality 1080p video streaming on mobile devices by optimizing the synergy between the two paradigms.
Contribution
It introduces a hybrid video system that encodes keyframes with generative models and uses traditional codecs for other frames, optimized end-to-end for efficiency and quality.
Findings
Achieves 150 FPS decoding on smartphones.
Outperforms pure neural methods in speed by orders of magnitude.
Improves perceptual quality with an 8% LPIPS gain over traditional codecs.
Abstract
In Video on Demand (VoD) scenarios, traditional codecs are the industry standard due to their high decoding efficiency. However, they suffer from severe quality degradation under low bandwidth conditions. While emerging generative neural codecs offer significantly higher perceptual quality, their reliance on heavy frame-by-frame generation makes real-time playback on mobile devices impractical. We ask: is it possible to combine the blazing-fast speed of traditional standards with the superior visual fidelity of neural approaches? We present HybridPrompt, the first generative-based video system capable of achieving real-time 1080p decoding at over 150 FPS on a commercial smartphone. Specifically, we employ a hybrid architecture that encodes Keyframes using a generative model while relying on traditional codecs for the remaining frames. A major challenge is that the two paradigms have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Generative Adversarial Networks and Image Synthesis · Video Coding and Compression Technologies
