GAN-Based Multi-View Video Coding with Spatio-Temporal EPI Reconstruction
Chengdong Lan, Hao Yan, Cheng Luo, Tiesong Zhao

TL;DR
This paper introduces a GAN-based multi-view video coding method that enhances side information reconstruction accuracy and reduces bitrate by leveraging spatio-temporal EPIs and a joint encoder constraint, outperforming existing techniques.
Contribution
It proposes a novel GAN-driven approach utilizing spatio-temporal EPIs and joint encoding constraints for improved multi-view video compression.
Findings
Significant improvement in Rate-Distortion performance.
Effective reduction of side information redundancy.
Enhanced reconstruction quality compared to state-of-the-art methods.
Abstract
The introduction of multiple viewpoints in video scenes inevitably increases the bitrates required for storage and transmission. To reduce bitrates, researchers have developed methods to skip intermediate viewpoints during compression and delivery, and ultimately reconstruct them using Side Information (SI). Typically, depth maps are used to construct SI. However, their methods suffer from inaccuracies in reconstruction and inherently high bitrates. In this paper, we propose a novel multi-view video coding method that leverages the image generation capabilities of Generative Adversarial Network (GAN) to improve the reconstruction accuracy of SI. Additionally, we consider incorporating information from adjacent temporal and spatial viewpoints to further reduce SI redundancy. At the encoder, we construct a spatio-temporal Epipolar Plane Image (EPI) and further utilize a convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Vision and Imaging
