A Practical Approach for Rate-Distortion-Perception Analysis in Learned   Image Compression

Ogun Kirmemis; A. Murat Tekalp

arXiv:2104.14836·eess.IV·May 3, 2021·PCS

A Practical Approach for Rate-Distortion-Perception Analysis in Learned Image Compression

Ogun Kirmemis, A. Murat Tekalp

PDF

TL;DR

This paper introduces a practical method to analyze the trade-off between rate, distortion, and perception in learned image compression by fixing the rate, enabling more meaningful perceptual evaluations beyond traditional metrics like PSNR.

Contribution

It proposes a novel approach to fix the rate in learned image compression, facilitating systematic analysis of the rate-distortion-perception trade-off.

Findings

01

Provides insights into perceptual quality at fixed rates.

02

Enables principled evaluation of learned image compression methods.

03

Highlights limitations of traditional metrics like PSNR.

Abstract

Rate-distortion optimization (RDO) of codecs, where distortion is quantified by the mean-square error, has been a standard practice in image/video compression over the years. RDO serves well for optimization of codec performance for evaluation of the results in terms of PSNR. However, it is well known that the PSNR does not correlate well with perceptual evaluation of images; hence, RDO is not well suited for perceptual optimization of codecs. Recently, rate-distortion-perception trade-off has been formalized by taking the Kullback-Leibner (KL) divergence between the distributions of the original and reconstructed images as a perception measure. Learned image compression methods that simultaneously optimize rate, mean-square loss, VGG loss, and an adversarial loss were proposed. Yet, there exists no easy approach to fix the rate, distortion or perception at a desired level in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDense Connections · Max Pooling · Convolution · Dropout · Softmax