A Dual-Critic Reinforcement Learning Framework for Frame-level Bit   Allocation in HEVC/H.265

Yung-Han Ho; Guo-Lun Jin; Yun Liang; Wen-Hsiao Peng; Xiaobo Li

arXiv:2104.01735·cs.CV·April 6, 2021

A Dual-Critic Reinforcement Learning Framework for Frame-level Bit Allocation in HEVC/H.265

Yung-Han Ho, Guo-Lun Jin, Yun Liang, Wen-Hsiao Peng, Xiaobo Li

PDF

Open Access

TL;DR

This paper proposes a dual-critic reinforcement learning framework for more effective frame-level bit allocation in HEVC/H.265, improving rate-distortion performance and rate control over existing methods.

Contribution

It introduces a dual-critic RL approach with separate critics for rate and distortion, enhancing generalization and precision in bit allocation for video coding.

Findings

01

Outperforms x265 in rate-distortion performance

02

Achieves more precise rate control

03

Significantly better than single-critic baseline

Abstract

This paper introduces a dual-critic reinforcement learning (RL) framework to address the problem of frame-level bit allocation in HEVC/H.265. The objective is to minimize the distortion of a group of pictures (GOP) under a rate constraint. Previous RL-based methods tackle such a constrained optimization problem by maximizing a single reward function that often combines a distortion and a rate reward. However, the way how these rewards are combined is usually ad hoc and may not generalize well to various coding conditions and video sequences. To overcome this issue, we adapt the deep deterministic policy gradient (DDPG) reinforcement learning algorithm for use with two critics, with one learning to predict the distortion reward and the other the rate reward. In particular, the distortion critic works to update the agent when the rate constraint is satisfied. By contrast, the rate critic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Coding and Compression Technologies · Image and Video Quality Assessment · Advanced Vision and Imaging

MethodsHigh-Order Consensuses