A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform   (IDCT) Design

Yifan Wang; Zhanxuan Mei; Chia-Yang Tsai; Ioannis Katsavounidis; C.-C.; Jay Kuo

arXiv:2102.00502·cs.MM·February 2, 2021·1 cites

A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

Yifan Wang, Zhanxuan Mei, Chia-Yang Tsai, Ioannis Katsavounidis, C.-C., Jay Kuo

PDF

Open Access

TL;DR

This paper introduces a machine learning-based method to design an optimal inverse DCT that better compensates for quantization errors in lossy image compression, improving image quality over standard JPEG.

Contribution

A novel IDCT kernel is learned using training images to reverse quantization effects, enhancing compression quality beyond traditional methods.

Findings

01

Achieves 0.11-0.30dB gain over standard JPEG

02

Effectively compensates for quantization errors

03

Improves image quality across various quality factors

Abstract

The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video coding standards without taking the quantization effect into account. Yet, the distribution of quantized DCT coefficients deviate from that of original DCT coefficients. This is particularly obvious when the quality factor of JPEG compressed images is small. To address this problem, we first use a set of training images to learn the compound effect of forward DCT, quantization and dequantization in cascade. Then, a new IDCT kernel is learned to reverse the effect of such a pipeline. Experiments are conducted to demonstrate that the advantage of the new method, which has a gain of 0.11-0.30dB over the standard JPEG over a wide range of quality factors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Enhancement Techniques · Advanced Data Compression Techniques