Parallelized Rate-Distortion Optimized Quantization Using Deep Learning
Dana Kianfar, Auke Wiggers, Amir Said, Reza Pourreza, Taco Cohen

TL;DR
This paper introduces neural network-based methods to accelerate rate-distortion optimized quantization in video coding, achieving significant bit-rate savings with low computational overhead.
Contribution
It presents a neural network approach for fast RDOQ that can be integrated into existing codecs without extra hardware, improving coding efficiency.
Findings
Achieves 1.64% BD-rate savings over scalar quantization.
Reaches 45% of the performance of traditional RDOQ.
Low computational overhead of the proposed neural networks.
Abstract
Rate-Distortion Optimized Quantization (RDOQ) has played an important role in the coding performance of recent video compression standards such as H.264/AVC, H.265/HEVC, VP9 and AV1. This scheme yields significant reductions in bit-rate at the expense of relatively small increases in distortion. Typically, RDOQ algorithms are prohibitively expensive to implement on real-time hardware encoders due to their sequential nature and their need to frequently obtain entropy coding costs. This work addresses this limitation using a neural network-based approach, which learns to trade-off rate and distortion during offline supervised training. As these networks are based solely on standard arithmetic operations that can be executed on existing neural network hardware, no additional area-on-chip needs to be reserved for dedicated RDOQ circuitry. We train two classes of neural networks, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
