Light Field Image Coding Using VVC standard and View Synthesis based on   Dual Discriminator GAN

Nader Bakir; Wassim Hamidouche; Sid Ahmed Fezza; Khouloud Samrouth and; Olivier Deforges

arXiv:2103.04201·eess.IV·March 9, 2021·IEEE Trans. Multim.

Light Field Image Coding Using VVC standard and View Synthesis based on Dual Discriminator GAN

Nader Bakir, Wassim Hamidouche, Sid Ahmed Fezza, Khouloud Samrouth and, Olivier Deforges

PDF

TL;DR

This paper introduces a novel light field image coding method that combines VVC standard encoding, view synthesis with a dual discriminator GAN, and quality enhancement to achieve superior compression efficiency for VR content.

Contribution

It proposes a new LF image coding approach that transmits only key views, synthesizes others with a dual discriminator GAN, and enhances quality for smooth navigation, outperforming existing methods.

Findings

01

Achieves -36.22% BD-BR reduction compared to state-of-the-art.

02

Improves BD-PSNR by 1.35 dB over previous methods.

03

Demonstrates effective view synthesis and quality enhancement for VR applications.

Abstract

Light field (LF) technology is considered as a promising way for providing a high-quality virtual reality (VR) content. However, such an imaging technology produces a large amount of data requiring efficient LF image compression solutions. In this paper, we propose a LF image coding method based on a view synthesis and view quality enhancement techniques. Instead of transmitting all the LF views, only a sparse set of reference views are encoded and transmitted, while the remaining views are synthesized at the decoder side. The transmitted views are encoded using the versatile video coding (VVC) standard and are used as reference views to synthesize the dropped views. The selection of non-reference dropped views is performed using a rate-distortion optimization based on the VVC temporal scalability. The dropped views are reconstructed using the LF dual discriminator GAN (LF-D2GAN) model.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.