Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior
Ruoyu Feng, Yunpeng Qi, Jinming Liu, Yixin Gao, Xin Li, Xin Jin, Zhibo Chen

TL;DR
Diff-ICMH introduces a generative image compression framework that harmonizes machine and human vision by ensuring semantic fidelity and perceptual realism, enabling multiple tasks with a single codec and minimal bit rate overhead.
Contribution
The paper proposes Diff-ICMH, a novel generative compression method that combines semantic consistency and perceptual quality, supporting diverse tasks without task-specific tuning.
Findings
Outperforms existing methods in multiple tasks
Maintains high perceptual quality for human viewers
Supports diverse intelligent tasks with a single codec
Abstract
Image compression methods are usually optimized isolatedly for human perception or machine analysis tasks. We reveal fundamental commonalities between these objectives: preserving accurate semantic information is paramount, as it directly dictates the integrity of critical information for intelligent tasks and aids human understanding. Concurrently, enhanced perceptual quality not only improves visual appeal but also, by ensuring realistic image distributions, benefits semantic feature extraction for machine tasks. Based on this insight, we propose Diff-ICMH, a generative image compression framework aiming for harmonizing machine and human vision in image compression. It ensures perceptual realism by leveraging generative priors and simultaneously guarantees semantic fidelity through the incorporation of Semantic Consistency loss (SC loss) during training. Additionally, we introduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Data Compression Techniques · Image and Video Quality Assessment
