Boosting Neural Image Compression for Machines Using Latent Space Masking
Kristian Fischer, Fabian Brand, Andr\'e Kaup

TL;DR
This paper introduces a neural image compression method optimized for machine analysis tasks, using latent space masking and feature-based loss to significantly reduce bitrate without sacrificing analysis accuracy.
Contribution
It proposes LSMnet, a parallel masking network that reduces bitrate by masking unnecessary latent features, and introduces feature-based loss for training without annotations.
Findings
41.4% bitrate savings compared to VVC for Mask R-CNN
Additional 27.3% bitrate savings with latent space masking
Effective training without annotated data using feature-based loss
Abstract
Today, many image coding scenarios do not have a human as final intended user, but rather a machine fulfilling computer vision tasks on the decoded image. Thereby, the primary goal is not to keep visual quality but maintain the task accuracy of the machine for a given bitrate. Due to the tremendous progress of deep neural networks setting benchmarking results, mostly neural networks are employed to solve the analysis tasks at the decoder side. Moreover, neural networks have also found their way into the field of image compression recently. These two developments allow for an end-to-end training of the neural compression network for an analysis network as information sink. Therefore, we first roll out such a training with a task-specific loss to enhance the coding performance of neural compression networks. Compared to the standard VVC, 41.4% of bitrate are saved by this method for Mask…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsRegion Proposal Network · Convolution · RoIAlign · Softmax · Mask R-CNN
