VQ-NeRV: A Vector Quantized Neural Representation for Videos
Yunjie Xu, Xiang Feng, Feiwei Qin, Ruiquan Ge, Yong Peng, Changmiao, Wang

TL;DR
VQ-NeRV introduces a vector quantized neural architecture with a novel codebook mechanism and optimization for improved video compression and reconstruction, outperforming previous methods like HNeRV in quality and efficiency.
Contribution
The paper proposes VQ-NeRV, a U-shaped neural architecture with a codebook for discretizing residual features, enhancing video compression and reconstruction capabilities.
Findings
VQ-NeRV achieves 1-2 dB higher PSNR than HNeRV.
VQ-NeRV uses less bits per pixel, improving compression.
VQ-NeRV improves video inpainting results.
Abstract
Implicit neural representations (INR) excel in encoding videos within neural networks, showcasing promise in computer vision tasks like video compression and denoising. INR-based approaches reconstruct video frames from content-agnostic embeddings, which hampers their efficacy in video frame regression and restricts their generalization ability for video interpolation. To address these deficiencies, Hybrid Neural Representation for Videos (HNeRV) was introduced with content-adaptive embeddings. Nevertheless, HNeRV's compression ratios remain relatively low, attributable to an oversight in leveraging the network's shallow features and inter-frame residual information. In this work, we introduce an advanced U-shaped architecture, Vector Quantized-NeRV (VQ-NeRV), which integrates a novel component--the VQ-NeRV Block. This block incorporates a codebook mechanism to discretize the network's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
MethodsInpainting
