Compact Latent Representation for Image Compression (CLRIC)
Ayman A. Ameen, Thomas Richter, and Andr\'e Kaup

TL;DR
This paper introduces a resource-efficient image compression method that leverages pre-trained latent representations and overfitted functions, achieving high perceptual quality without multiple models for different quality levels.
Contribution
It presents a novel, model-agnostic, resolution-agnostic compression approach using overfitted functions on pre-trained latent spaces, reducing resource requirements.
Findings
Achieves perceptual quality comparable to state-of-the-art models.
Operates with low computational complexity (~25.5 MAC/pixel).
Eliminates the need for multiple models for different quality levels.
Abstract
Current image compression models often require separate models for each quality level, making them resource-intensive in terms of both training and storage. To address these limitations, we propose an innovative approach that utilizes latent variables from pre-existing trained models (such as the Stable Diffusion Variational Autoencoder) for perceptual image compression. Our method eliminates the need for distinct models dedicated to different quality levels. We employ overfitted learnable functions to compress the latent representation from the target model at any desired quality level. These overfitted functions operate in the latent space, ensuring low computational complexity, around MAC/pixel for a forward pass on images with dimensions pixels. This approach efficiently utilizes resources during both training and decoding. Our method achieves comparable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
