A New Image Codec Paradigm for Human and Machine Uses
Sien Chen, Jian Jin, Lili Meng, Weisi Lin, Zhuo Chen, Tsui-Shan Chang,, Zhengguang Li, Huaxiang Zhang

TL;DR
This paper introduces a novel image codec that efficiently supports both human viewing and machine analysis by combining neural network-based feature extraction, scalable encoding, and residual compression, outperforming traditional codecs in quality and machine vision tasks.
Contribution
It proposes a new image codec paradigm that integrates neural network features and scalable encoding for dual human and machine use, with improved performance over existing codecs.
Findings
Achieves comparable image quality to learning-based codecs
Outperforms traditional codecs in PSNR and MS-SSIM
Enhances object detection and segmentation accuracy
Abstract
With the AI of Things (AIoT) development, a huge amount of visual data, e.g., images and videos, are produced in our daily work and life. These visual data are not only used for human viewing or understanding but also for machine analysis or decision-making, e.g., intelligent surveillance, automated vehicles, and many other smart city applications. To this end, a new image codec paradigm for both human and machine uses is proposed in this work. Firstly, the high-level instance segmentation map and the low-level signal features are extracted with neural networks. Then, the instance segmentation map is further represented as a profile with the proposed 16-bit gray-scale representation. After that, both 16-bit gray-scale profile and signal features are encoded with a lossless codec. Meanwhile, an image predictor is designed and trained to achieve the general-quality image reconstruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
