End-to-end optimized image compression for machines, a study

Lahiru D. Chamain; Fabien Racap\'e; Jean B\'egaint; Akshay Pushparaja,; Simon Feltman

arXiv:2011.06409·eess.IV·November 13, 2020

End-to-end optimized image compression for machines, a study

Lahiru D. Chamain, Fabien Racap\'e, Jean B\'egaint, Akshay Pushparaja,, Simon Feltman

PDF

TL;DR

This paper presents an end-to-end neural network-based image compression framework optimized specifically for machine analysis tasks, significantly improving accuracy at low bit-rates compared to traditional codecs.

Contribution

It introduces a joint training approach for compression and task networks, enabling rate-accuracy improvements and flexible fine-tuning for remote machine analysis applications.

Findings

01

Joint training improves task accuracy at low bit-rates.

02

Selective fine-tuning achieves better rate-accuracy trade-offs.

03

End-to-end pipelines are flexible for practical use.

Abstract

An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately, conventional coding tools are challenging to specialize for machine tasks as they were originally designed for human perception. However, neural network based codecs can be jointly trained end-to-end with any convolutional neural network (CNN)-based task model. In this paper, we propose to study an end-to-end framework enabling efficient image compression for remote machine task analysis, using a chain composed of a compression module and a task algorithm that can be optimized end-to-end. We show that it is possible to significantly improve the task accuracy when fine-tuning jointly the codec and the task networks, especially at low bit-rates. Depending on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.