Frequency Disentangled Features in Neural Image Compression
Ali Zafari, Atefeh Khoshkhahtinat, Piyush Mehta, Mohammad Saeed, Ebrahimi Saadabadi, Mohammad Akyash, Nasser M. Nasrabadi

TL;DR
This paper introduces a frequency disentanglement approach in neural image compression that improves entropy modeling and reduces bit rates by separating low and high-frequency features, enhancing compression efficiency.
Contribution
It proposes a novel frequency disentanglement method and a Hadamard-based self-attention mechanism to improve neural image compression performance.
Findings
Outperforms hand-engineered codecs.
Surpasses neural codecs with autoregressive models.
Achieves lower bit rates with better quality.
Abstract
The design of a neural image compression network is governed by how well the entropy model matches the true distribution of the latent code. Apart from the model capacity, this ability is indirectly under the effect of how close the relaxed quantization is to the actual hard quantization. Optimizing the parameters of a rate-distortion variational autoencoder (R-D VAE) is ruled by this approximated quantization scheme. In this paper, we propose a feature-level frequency disentanglement to help the relaxed scalar quantization achieve lower bit rates by guiding the high entropy latent features to include most of the low-frequency texture of the image. In addition, to strengthen the de-correlating power of the transformer-based analysis/synthesis transform, an augmented self-attention score calculation based on the Hadamard product is utilized during both encoding and decoding. Channel-wise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Advanced Image Processing Techniques · Advanced Vision and Imaging
