Deep Image Compression via End-to-End Learning

Haojie Liu; Tong Chen; Qiu Shen; Tao Yue; Zhan Ma

arXiv:1806.01496·eess.IV·June 6, 2018·21 cites

Deep Image Compression via End-to-End Learning

Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep CNN-based image compression method that surpasses traditional codecs in visual quality at the same bit rate, using perceptual and adversarial training techniques for improved results.

Contribution

The authors propose a novel end-to-end deep learning framework with perceptual and adversarial losses, along with transfer learning for better rate-distortion optimization, outperforming existing codecs.

Findings

01

Achieves higher MS-SSIM than BPG, WebP, JPEG2000, JPEG

02

Reduces BD-rate by 7.81% on Kodak dataset

03

Reduces BD-rate by 19.1% on ETH Zurich dataset

Abstract

We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate. Currently, most of the CNNs based approaches train the network using a L2 loss between the reconstructions and the ground-truths in the pixel domain, which leads to over-smoothing results and visual quality degradation especially at a very low bit rate. Therefore, we improve the subjective quality with the combination of a perception loss and an adversarial loss additionally. To achieve better rate-distortion optimization (RDO), we also introduce an easy-to-hard transfer learning when adding quantization error and rate constraint. Finally, we evaluate our method on public Kodak and the Test Dataset P/M released by the Computer Vision Lab of ETH Zurich,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pkorus/l3ic
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques