Estimating the Resize Parameter in End-to-end Learned Image Compression
Li-Heng Chen, Christos G. Bampis, Zhi Li, Luk\'a\v{s} Krasula, and Alan C. Bovik

TL;DR
This paper introduces a differentiable resizing framework with an auxiliary neural network to optimize image size reduction, improving rate-distortion performance in learned image compression models by about 10%.
Contribution
A novel, search-free resizing method using neural networks for adaptive downsampling in learned image compression, enhancing compression efficiency.
Findings
Achieved approximately 10% BD-rate improvement.
Demonstrated favorable subjective image quality.
Provided an open-source implementation for reproducibility.
Abstract
We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models. Our approach is simple: compose a pair of differentiable downsampling/upsampling layers that sandwich a neural compression model. To determine resize factors for different inputs, we utilize another neural network jointly trained with the compression model, with the end goal of minimizing the rate-distortion objective. Our results suggest that "compression friendly" downsampled representations can be quickly determined during encoding by using an auxiliary network and differentiable image warping. By conducting extensive experimental tests on existing deep image compression models, we show results that our new resizing parameter estimation framework can provide Bj{\o}ntegaard-Delta rate (BD-rate) improvement of about 10% against leading…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
