Boosting neural video codecs by exploiting hierarchical redundancy
Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautiere, Auke Wiggers

TL;DR
This paper introduces lightweight predictors for neural video codecs to capture second-order hierarchical redundancy, significantly improving compression efficiency in RGB and YUV420 colorspaces.
Contribution
It proposes generic motion and residual predictors that learn to extrapolate from previous data, enhancing neural video codecs' rate-distortion performance.
Findings
38% bitrate savings in RGB
34% bitrate savings in YUV420
Effective in capturing second-order redundancy
Abstract
In video compression, coding efficiency is improved by reusing pixels from previously decoded frames via motion and residual compensation. We define two levels of hierarchical redundancy in video frames: 1) first-order: redundancy in pixel space, i.e., similarities in pixel values across neighboring frames, which is effectively captured using motion and residual compensation, 2) second-order: redundancy in motion and residual maps due to smooth motion in natural videos. While most of the existing neural video coding literature addresses first-order redundancy, we tackle the problem of capturing second-order redundancy in neural video codecs via predictors. We introduce generic motion and residual predictors that learn to extrapolate from previously decoded data. These predictors are lightweight, and can be employed with most neural video codecs in order to improve their rate-distortion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Boosting neural video codecs by exploiting hierarchical redundancy· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis
