Learned Image Coding for Machines: A Content-Adaptive Approach

Nam Le; Honglei Zhang; Francesco Cricri; Ramin Ghaznavi-Youvalari,; Hamed Rezazadegan Tavakoli; Esa Rahtu

arXiv:2108.09992·eess.IV·October 14, 2021

Learned Image Coding for Machines: A Content-Adaptive Approach

Nam Le, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi-Youvalari,, Hamed Rezazadegan Tavakoli, Esa Rahtu

PDF

TL;DR

This paper introduces a content-adaptive finetuning method for learned image codecs that enhances compression efficiency for machine-to-machine communication, achieving significant bitrate savings over traditional codecs.

Contribution

It proposes an inference-time finetuning scheme for learned image codecs, improving compression efficiency specifically for machine consumption scenarios.

Findings

01

Average bitrate saving of -3.66% (BD-rate) over pretrained codec.

02

Significant bitrate saving of -9.85% at low bitrates.

03

Achieves -30.54% BD-rate improvement over VVC.

Abstract

Today, according to the Cisco Annual Internet Report (2018-2023), the fastest-growing category of Internet traffic is machine-to-machine communication. In particular, machine-to-machine communication of images and videos represents a new challenge and opens up new perspectives in the context of data compression. One possible solution approach consists of adapting current human-targeted image and video coding standards to the use case of machine consumption. Another approach consists of developing completely new compression paradigms and architectures for machine-to-machine communications. In this paper, we focus on image compression and present an inference-time content-adaptive finetuning scheme that optimizes the latent representation of an end-to-end learned image codec, aimed at improving the compression efficiency for machine-consumption. The conducted experiments show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.