VQ-NeRV: A Vector Quantized Neural Representation for Videos

Yunjie Xu; Xiang Feng; Feiwei Qin; Ruiquan Ge; Yong Peng; Changmiao; Wang

arXiv:2403.12401·cs.CV·March 20, 2024·1 cites

VQ-NeRV: A Vector Quantized Neural Representation for Videos

Yunjie Xu, Xiang Feng, Feiwei Qin, Ruiquan Ge, Yong Peng, Changmiao, Wang

PDF

Open Access 1 Repo

TL;DR

VQ-NeRV introduces a vector quantized neural architecture with a novel codebook mechanism and optimization for improved video compression and reconstruction, outperforming previous methods like HNeRV in quality and efficiency.

Contribution

The paper proposes VQ-NeRV, a U-shaped neural architecture with a codebook for discretizing residual features, enhancing video compression and reconstruction capabilities.

Findings

01

VQ-NeRV achieves 1-2 dB higher PSNR than HNeRV.

02

VQ-NeRV uses less bits per pixel, improving compression.

03

VQ-NeRV improves video inpainting results.

Abstract

Implicit neural representations (INR) excel in encoding videos within neural networks, showcasing promise in computer vision tasks like video compression and denoising. INR-based approaches reconstruct video frames from content-agnostic embeddings, which hampers their efficacy in video frame regression and restricts their generalization ability for video interpolation. To address these deficiencies, Hybrid Neural Representation for Videos (HNeRV) was introduced with content-adaptive embeddings. Nevertheless, HNeRV's compression ratios remain relatively low, attributable to an oversight in leveraging the network's shallow features and inter-frame residual information. In this work, we introduce an advanced U-shaped architecture, Vector Quantized-NeRV (VQ-NeRV), which integrates a novel component--the VQ-NeRV Block. This block incorporates a codebook mechanism to discretize the network's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

magicffourier/vq-nerv
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition

MethodsInpainting