JPD-SE: High-Level Semantics for Joint Perception-Distortion Enhancement in Image Compression
Shiyu Duan, Huaijin Chen, Jinwei Gu

TL;DR
This paper introduces a semantic-aware image compression framework that leverages high-level semantics to improve perception quality, rate-distortion performance, and downstream computer vision tasks through a novel joint optimization approach.
Contribution
It proposes a generic framework and training scheme enabling any codec to incorporate high-level semantics for enhanced perception and downstream task performance.
Findings
Semantic-aware codecs outperform traditional codecs in perception quality.
Joint optimization improves rate-distortion-perception trade-offs.
Enhanced downstream computer vision algorithm performance.
Abstract
While humans can effortlessly transform complex visual scenes into simple words and the other way around by leveraging their high-level understanding of the content, conventional or the more recent learned image compression codecs do not seem to utilize the semantic meanings of visual content to their full potential. Moreover, they focus mostly on rate-distortion and tend to underperform in perception quality especially in low bitrate regime, and often disregard the performance of downstream computer vision algorithms, which is a fast-growing consumer group of compressed images in addition to human viewers. In this paper, we (1) present a generic framework that can enable any image codec to leverage high-level semantics and (2) study the joint optimization of perception quality and distortion. Our idea is that given any codec, we utilize high-level semantics to augment the low-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Data Compression Techniques · Image Enhancement Techniques
