Explicit Residual-Based Scalable Image Coding for Humans and Machines
Yui Tatsumi, Ziyue Zeng, Hiroshi Watanabe

TL;DR
This paper introduces explicit residual-based scalable image coding methods that improve efficiency and interpretability for both human and machine image consumption, achieving significant compression gains.
Contribution
It proposes two novel residual-based scalable coding methods, FR-ICMH and PR-ICMH, enhancing existing neural network-based image compression frameworks with explicit residual mechanisms.
Findings
PR-ICMH achieves up to 29.57% BD-rate savings.
Methods are applicable to various machine vision tasks.
Enhanced interpretability and flexibility in encoder complexity.
Abstract
Scalable image compression is a technique that progressively reconstructs multiple versions of an image for different requirements. In recent years, images have increasingly been consumed not only by humans but also by image recognition models. This shift has drawn growing attention to scalable image compression methods that serve both machine and human vision (ICMH). Many existing models employ neural network-based codecs, known as learned image compression, and have made significant strides in this field by carefully designing the loss functions. In some cases, however, models are overly reliant on their learning capacity, and their architectural design is not sufficiently considered. In this paper, we enhance the coding efficiency and interpretability of ICMH framework by integrating an explicit residual compression mechanism, which is commonly employed in resolution scalable coding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Image Processing Techniques · Advanced Data Compression Techniques · Medical Image Segmentation Techniques
