A Dual-Critic Reinforcement Learning Framework for Frame-level Bit Allocation in HEVC/H.265
Yung-Han Ho, Guo-Lun Jin, Yun Liang, Wen-Hsiao Peng, Xiaobo Li

TL;DR
This paper proposes a dual-critic reinforcement learning framework for more effective frame-level bit allocation in HEVC/H.265, improving rate-distortion performance and rate control over existing methods.
Contribution
It introduces a dual-critic RL approach with separate critics for rate and distortion, enhancing generalization and precision in bit allocation for video coding.
Findings
Outperforms x265 in rate-distortion performance
Achieves more precise rate control
Significantly better than single-critic baseline
Abstract
This paper introduces a dual-critic reinforcement learning (RL) framework to address the problem of frame-level bit allocation in HEVC/H.265. The objective is to minimize the distortion of a group of pictures (GOP) under a rate constraint. Previous RL-based methods tackle such a constrained optimization problem by maximizing a single reward function that often combines a distortion and a rate reward. However, the way how these rewards are combined is usually ad hoc and may not generalize well to various coding conditions and video sequences. To overcome this issue, we adapt the deep deterministic policy gradient (DDPG) reinforcement learning algorithm for use with two critics, with one learning to predict the distortion reward and the other the rate reward. In particular, the distortion critic works to update the agent when the rate constraint is satisfied. By contrast, the rate critic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Image and Video Quality Assessment · Advanced Vision and Imaging
MethodsHigh-Order Consensuses
