PNVC: Towards Practical INR-based Video Compression
Ge Gao, Ho Man Kwan, Fan Zhang, David Bull

TL;DR
PNVC is a practical neural video codec that combines autoencoder and INR methods, achieving significant compression savings and real-time decoding speeds, advancing INR-based video coding towards deployment.
Contribution
Introduces PNVC, a novel INR-based video codec with structural reparameterization, hierarchical quality control, and scale-aware embeddings, outperforming existing INR codecs in efficiency and speed.
Findings
Achieves over 35% BD-rate savings against HEVC HM 18.0.
Maintains 20+ FPS decoding speed for 1080p videos.
Supports both low delay and random access configurations.
Abstract
Neural video compression has recently demonstrated significant potential to compete with conventional video codecs in terms of rate-quality performance. These learned video codecs are however associated with various issues related to decoding complexity (for autoencoder-based methods) and/or system delays (for implicit neural representation (INR) based models), which currently prevent them from being deployed in practical applications. In this paper, targeting a practical neural video codec, we propose a novel INR-based coding framework, PNVC, which innovatively combines autoencoder-based and overfitted solutions. Our approach benefits from several design innovations, including a new structural reparameterization-based architecture, hierarchical quality control, modulation-based entropy modeling, and scale-aware positional embedding. Supporting both low delay (LD) and random access (RA)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Image and Signal Denoising Methods
