Unified ROI-based Image Compression Paradigm with Generalized Gaussian Model
Kai Hu, Junfu Tan, Fang Xu, Ramy Samy, Yu Liu

TL;DR
This paper introduces a unified theoretical framework and a novel Generalized Gaussian Model for ROI-based image compression, significantly improving coding performance and downstream task accuracy by better modeling the distribution of semantic importance regions.
Contribution
It develops a rate-distortion optimization paradigm and proposes GGM with differentiable functions and a dynamic lower bound for stable training, outperforming Gaussian models in ROI coding.
Findings
Achieves state-of-the-art ROI reconstruction results.
Improves downstream task performance such as segmentation and detection.
Provides a more accurate distribution fit than classical models.
Abstract
Region-of-Interest (ROI)-based image compression allocates bits unevenly according to the semantic importance of different regions. Such differentiated coding typically induces a sharp-peaked and heavy-tailed distribution. This distribution characteristic mathematically necessitates a probability model with adaptable shape parameters for accurate description. However, existing methods commonly use a Gaussian model to fit this distribution, resulting in a loss of coding performance. To systematically analyze the impact of this distribution on ROI coding, we develop a unified rate-distortion optimization theoretical paradigm. Building on this paradigm, we propose a novel Generalized Gaussian Model (GGM) to achieve flexible modeling of the latent variables distribution. To support stable optimization of GGM, we introduce effective differentiable functions and further propose a dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Advanced Image Processing Techniques · Video Coding and Compression Technologies
