Immersive Video Compression using Implicit Neural Representations

Ho Man Kwan; Fan Zhang; Andrew Gower; David Bull

arXiv:2402.01596·eess.IV·November 22, 2024·1 cites

Immersive Video Compression using Implicit Neural Representations

Ho Man Kwan, Fan Zhang, Andrew Gower, David Bull

PDF

Open Access 1 Repo

TL;DR

This paper introduces MV-HiNeRV, an innovative INR-based codec for immersive multi-view videos, which effectively exploits redundancies to achieve significant compression gains over existing methods.

Contribution

It extends INR-based video compression to immersive videos by developing MV-HiNeRV, incorporating view-specific feature grids and shared parameters for improved redundancy exploitation.

Findings

01

Achieves up to 72.33% coding gains over TMIV.

02

Effectively exploits spatio-temporal and inter-view redundancies.

03

Outperforms existing immersive video codecs in tests.

Abstract

Recent work on implicit neural representations (INRs) has evidenced their potential for efficiently representing and encoding conventional video content. In this paper we, for the first time, extend their application to immersive (multi-view) videos, by proposing MV-HiNeRV, a new INR-based immersive video codec. MV-HiNeRV is an enhanced version of a state-of-the-art INR-based video codec, HiNeRV, which was developed for single-view video compression. We have modified the model to learn a different group of feature grids for each view, and share the learnt network parameters among all views. This enables the model to effectively exploit the spatio-temporal and the inter-view redundancy that exists within multi-view videos. The proposed codec was used to compress multi-view texture and depth video sequences in the MPEG Immersive Video (MIV) Common Test Conditions, and tested against the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hmkx/mv-hinerv
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image and Signal Denoising Methods