Learning Image and Video Compression through Spatial-Temporal Energy   Compaction

Zhengxue Cheng; Heming Sun; Masaru Takeuchi; Jiro Katto

arXiv:1906.09683·eess.IV·July 1, 2019·CVPR·1 cites

Learning Image and Video Compression through Spatial-Temporal Energy Compaction

Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto

PDF

Open Access

TL;DR

This paper introduces a novel learning-based image and video compression method that leverages spatial-temporal energy compaction, resulting in superior performance over traditional standards and existing learning methods, especially at high bit rates.

Contribution

It presents a convolutional autoencoder architecture for image compression and extends it to video by incorporating an interpolation loop and energy-based penalties, optimizing spatial-temporal energy distribution.

Findings

01

Outperforms latest image compression standards with MS-SSIM.

02

Significantly outperforms MPEG-4 in video compression.

03

Competitive with H.264, producing more visually pleasing results.

Abstract

Compression has been an important research topic for many decades, to produce a significant impact on data transmission and storage. Recent advances have shown a great potential of learning image and video compression. Inspired from related works, in this paper, we present an image compression architecture using a convolutional autoencoder, and then generalize image compression to video compression, by adding an interpolation loop into both encoder and decoder sides. Our basic idea is to realize spatial-temporal energy compaction in learning image and video compression. Thereby, we propose to add a spatial energy compaction-based penalty into loss function, to achieve higher image compression performance. Furthermore, based on temporal energy distribution, we propose to select the number of frames in one interpolation loop, adapting to the motion characteristics of video contents.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Advanced Image Processing Techniques · Video Coding and Compression Technologies