Generative Video Compression with One-Dimensional Latent Representation
Zihan Zheng, Zhaoyang Jia, Naifu Xue, Jiahao Li, Bin Li, Zongyu Guo, Xiaoyi Zhang, Zhenghao Chen, Houqiang Li, Yan Lu

TL;DR
This paper introduces GVC1D, a novel generative video compression method using a 1D latent representation that reduces redundancy and improves compression efficiency by leveraging semantic regions and long-term context.
Contribution
GVC1D replaces traditional 2D latent grids with a 1D token-based approach, enabling adaptive attention to semantic regions and efficient long-term context modeling for better compression.
Findings
Achieves 60.4% bitrate reduction under LPIPS
Achieves 68.8% bitrate reduction under DISTS
Outperforms previous video compression methods
Abstract
Recent advancements in generative video codec (GVC) typically encode video into a 2D latent grid and employ high-capacity generative decoders for reconstruction. However, this paradigm still leaves two key challenges in fully exploiting spatial-temporal redundancy: Spatially, the 2D latent grid inevitably preserves intra-frame redundancy due to its rigid structure, where adjacent patches remain highly similar, thereby necessitating a higher bitrate. Temporally, the 2D latent grid is less effective for modeling long-term correlations in a compact and semantically coherent manner, as it hinders the aggregation of common contents across frames. To address these limitations, we introduce Generative Video Compression with One-Dimensional (1D) Latent Representation (GVC1D). GVC1D encodes the video data into extreme compact 1D latent tokens conditioned on both short- and long-term contexts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Data Compression Techniques · Video Coding and Compression Technologies
