Controlling Rate, Distortion, and Realism: Towards a Single   Comprehensive Neural Image Compression Model

Shoma Iwai; Tomo Miyazaki; Shinichiro Omachi

arXiv:2405.16817·cs.CV·May 28, 2024

Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model

Shoma Iwai, Tomo Miyazaki, Shinichiro Omachi

PDF

1 Repo

TL;DR

This paper introduces a versatile neural image compression model that allows users to control rate, distortion, and realism simultaneously within a single model, outperforming previous methods that required multiple models for different bit rates.

Contribution

The authors develop a novel variable-rate generative NIC model with tailored discriminator designs and a multi-realism technique, enabling ultra-controllable image compression in a single model.

Findings

01

Achieves comparable or better performance than state-of-the-art single-rate models.

02

Supports a wide range of bit rates with one model.

03

Allows user adjustment of rate, distortion, and realism.

Abstract

In recent years, neural network-driven image compression (NIC) has gained significant attention. Some works adopt deep generative models such as GANs and diffusion models to enhance perceptual quality (realism). A critical obstacle of these generative NIC methods is that each model is optimized for a single bit rate. Consequently, multiple models are required to compress images to different bit rates, which is impractical for real-world applications. To tackle this issue, we propose a variable-rate generative NIC model. Specifically, we explore several discriminator designs tailored for the variable-rate approach and introduce a novel adversarial loss. Moreover, by incorporating the newly proposed multi-realism technique, our method allows the users to adjust the bit rate, distortion, and realism with a single model, achieving ultra-controllability. Unlike existing variable-rate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iwa-shi/CRDR
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion