Image Coding for Machines with Object Region Learning
Takahiro Shindo, Taiju Watanabe, Kein Yamada, Hiroshi Watanabe

TL;DR
This paper introduces a novel image compression model that learns object regions without needing extra input or task-specific loss, enabling versatile image coding for various recognition models.
Contribution
The proposed model learns object regions directly, eliminating the need for ROI-maps or task-loss, thus supporting multiple image recognition models with a single compression approach.
Findings
Effective across multiple recognition models
Versatile for different datasets
Outperforms previous methods in experiments
Abstract
Compression technology is essential for efficient image transmission and storage. With the rapid advances in deep learning, images are beginning to be used for image recognition as well as for human vision. For this reason, research has been conducted on image coding for image recognition, and this field is called Image Coding for Machines (ICM). There are two main approaches in ICM: the ROI-based approach and the task-loss-based approach. The former approach has the problem of requiring an ROI-map as input in addition to the input image. The latter approach has the problems of difficulty in learning the task-loss, and lack of robustness because the specific image recognition model is used to compute the loss function. To solve these problems, we propose an image compression model that learns object regions. Our model does not require additional information as input, such as an ROI-map,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Medical Image Segmentation Techniques
