NVRC: Neural Video Representation Compression
Ho Man Kwan, Ge Gao, Fan Zhang, Andrew Gower, David Bull

TL;DR
NVRC introduces a novel end-to-end INR-based video compression framework with advanced entropy coding and hierarchical model compression, achieving significant performance gains over standard codecs like VVC VTM.
Contribution
The paper presents the first INR-based video codec optimized end-to-end with new entropy coding and hierarchical model compression techniques.
Findings
Outperforms VVC VTM by 24% in PSNR on UVG dataset.
First INR-based video codec to achieve competitive performance.
End-to-end optimization of INR-based video compression.
Abstract
Recent advances in implicit neural representation (INR)-based video coding have demonstrated its potential to compete with both conventional and other learning-based approaches. With INR methods, a neural network is trained to overfit a video sequence, with its parameters compressed to obtain a compact representation of the video content. However, although promising results have been achieved, the best INR-based methods are still out-performed by the latest standard codecs, such as VVC VTM, partially due to the simple model compression techniques employed. In this paper, rather than focusing on representation architectures as in many existing works, we propose a novel INR-based video compression framework, Neural Video Representation Compression (NVRC), targeting compression of the representation. Based on the novel entropy coding and quantization models proposed, NVRC, for the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Human Pose and Action Recognition
