A Unified End-to-End Framework for Efficient Deep Image Compression
Jiaheng Liu, Guo Lu, Zhihao Hu, Dong Xu

TL;DR
This paper introduces EDIC, a unified deep learning framework for image compression that significantly improves efficiency and performance, outperforming current methods and enhancing deep video compression.
Contribution
The paper proposes a novel end-to-end framework with channel attention, Gaussian mixture entropy modeling, and decoder enhancement, achieving superior compression with reduced computational cost.
Findings
Outperforms state-of-the-art image compression methods
Increases decoding speed by over 150 times
Enhances deep video compression performance
Abstract
Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current state-of-the-art learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Image Processing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
