Convolutional Autoencoders for Lossy Light Field Compression

Svetozar Zarko Valtchev; Jianhong Wu

arXiv:2008.00027·eess.IV·August 4, 2020·1 cites

Convolutional Autoencoders for Lossy Light Field Compression

Svetozar Zarko Valtchev, Jianhong Wu

PDF

Open Access

TL;DR

This paper presents a convolutional autoencoder-based method for lossy compression of 4D Light Field data, achieving high compression ratios with minimal quality loss and real-time encoding and decoding capabilities.

Contribution

It introduces an efficient neural network architecture specifically designed for 4D Light Field compression, achieving a 48.6x compression ratio with high quality metrics.

Findings

01

Achieved 48.6x compression with PSNR of 29.46 dB

02

Real-time encoding and decoding speeds of around 1.6-1.8 seconds

03

Network size is 584MB, suitable for practical use

Abstract

Expansion and reduction of a neural network's width has well known properties in terms of the entropy of the propagating information. When carefully stacked on top of one another, an encoder network and a decoder network produce an autoencoder, often used in compression. Using this architecture, we develop an efficient method of encoding and decoding 4D Light Field data, with a substantial compression factor at a minimal loss in quality. Our best results managed to achieve a compression of 48.6x, with a PSNR of 29.46 dB and a SSIM of 0.8104. Computations of the encoder and decoder can be run in real time, with average computation times of 1.62s and 1.81s respectively, and the entire network occupies a reasonable 584MB by today's storage standards.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · CCD and CMOS Imaging Sensors