Enhancing Neural Video Compression of Static Scenes with Positive-Incentive Noise
Cheng Yuan, Zhenyu Jia, Jiawei Shao, Xuelong Li

TL;DR
This paper introduces a novel framework called PIC that improves neural video compression of static scenes by reinterpreting short-term changes as positive noise, achieving near-lossless quality at extremely low bitrates.
Contribution
The proposed PIC framework effectively disentangles transient variations from static backgrounds, enabling highly efficient compression with minimal data transmission.
Findings
Achieves visually lossless reconstruction at 0.009% bitrate
Outperforms baseline by 20.5% in BD rate reduction
Enables robust transmission of static scene videos under low bandwidth
Abstract
Static scene videos, such as surveillance feeds and videotelephony streams, constitute a dominant share of storage consumption and network traffic. However, both traditional standardized codecs and neural video compression (NVC) methods struggle to encode these videos efficiently due to inadequate usage of temporal redundancy and severe distribution gaps between training and test data, respectively. While recent generative compression methods improve perceptual quality, they introduce hallucinated details that are unacceptable in authenticity-critical applications. To overcome these limitations, we propose a positive-incentive camera (PIC) framework for static scene videos, where short-term temporal changes are reinterpreted as positive-incentive noise to facilitate NVC model finetuning. By disentangling transient variations from the persistent background, structured prior information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
