Learned Image Compression for Machine Perception
Felipe Codevilla, Jean Gabriel Simard, Ross Goroshin, Chris Pal

TL;DR
This paper introduces a learned image compression method optimized for both human perception and machine vision tasks, achieving smaller representations with maintained or improved performance on segmentation and detection.
Contribution
It develops a novel framework for compressing images that enhances performance on vision tasks while reducing data size, surpassing traditional JPEGs.
Findings
Outperforms JPEG in segmentation and detection accuracy.
Achieves 4-10x smaller representations with minimal performance loss.
Supports training models directly from compressed representations.
Abstract
Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing applications of computer vision, high quality image reconstruction from a compressible representation is often a secondary objective. Compression that ensures high accuracy on computer vision tasks such as image segmentation, classification, and detection therefore has the potential for significant impact across a wide variety of settings. In this work, we develop a framework that produces a compression format suitable for both human perception and machine perception. We show that representations can be learned that simultaneously optimize for compression and performance on core vision tasks. Our approach allows models to be trained directly from compressed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Image Processing Techniques · Sparse and Compressive Sensing Techniques
