Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules
Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto

TL;DR
This paper introduces a novel learned image compression method using discretized Gaussian Mixture Likelihoods and attention modules, achieving state-of-the-art results and comparable PSNR performance to traditional standards like VVC.
Contribution
The paper proposes a more accurate entropy model with Gaussian Mixture Likelihoods and integrates attention modules, significantly improving learned image compression performance.
Findings
Achieves state-of-the-art results on Kodak and high-resolution datasets.
Comparable PSNR performance with VVC standard.
Produces more visually pleasing images when optimized with MS-SSIM.
Abstract
Image compression is a fundamental research field and many well-known compression standards have been developed for many decades. Recently, learned compression methods exhibit a fast development trend with promising results. However, there is still a performance gap between learned compression algorithms and reigning compression standards, especially in terms of widely used PSNR metric. In this paper, we explore the remaining redundancy of recent learned compression algorithms. We have found accurate entropy models for rate estimation largely affect the optimization of network parameters and thus affect the rate-distortion performance. Therefore, in this paper, we propose to use discretized Gaussian Mixture Likelihoods to parameterize the distributions of latent codes, which can achieve a more accurate and flexible entropy model. Besides, we take advantage of recent attention modules…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Learned Image Compression With Discretized Gaussian Mixture Likelihoods and Attention Modules· youtube
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Image Processing Techniques
